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Preface and Acknowledgments 


Preface 


A number of my committee members have told me that my writing is not quite like a 
traditional doctoral students — both for good and for ill. I suppose this comes from 
the fact that I identify as a fiction writer as much as, or more than, 1 see myself as a 
scholar. Given this, l am thankful for the ways that my committee has reigned in my 
unconscious gravitation toward tools of fiction (e.g., suspense) that are inappropriate 
in the scholarly domain, while still allowing me to structure my prose toward what I 
hope is a pleasing flow of language and argument. The result, in its best moments, 
can perhaps be seen as a productive hybrid between traditional dissertation writing 
and my personal writing inclinations. 

However, despite the best efforts of my committee, I have not been able to com- 
pletely move away from my tendency toward the “slow reveal.” In the spirit of 
ameliorating this, let me list here — at the very outset — some of the things I hope 


are provided by this study: 


e Á set of example interpretations of expressive processes. These range from the 


first work of digital literature (Christopher Strachey's love letter generator) 


lv 


through a set of historical examples (e.g., Tristan Tzara's newspaper poem, 
the Oulipo's “N + 7”) and culminate in a critical survey of the field of story 
generation. In each case the specific operations of the processes are examined, 
compared with the operations of related processes, and considered within the 


context of their initial creation and reception. 


A clearer set of concepts for discussing the internals of digital literature and 
digital media. Beginning with the simple distinction between surface, data, and 
process — and then building up to a more fine-grained model of process-oriented 
contributions described with these terms: implemented processes, abstract pro- 


cesses, operational logics, and works of process. 


A sense of what is brought into focus, and what is marginalized, when com- 
paring processes. Specifically, comparisons of processes carried out by authors, 
audiences, and automatic computation — and, within this, the differing forms 


of indeterminacy as viewed from each perspective. 


A consideration of some of the culture of computation, from the perspective of 
digital literature. By taking the forms of literature, and by being clearly au- 
thored artifacts, works of digital literature can provide legible examples of ideas 
in circulation in the wider fields of computation. This study examines works 
that reflect “big dreams” such as automatic authorship, simulating human be- 
havior, and a computational universal language. It also uses digital literature, 
and particularly fiction, as a focusing mechanism for looking at the broad cur- 


rents in fields such as artificial intelligence and natural language processing. 


Finally, its initial chapters use digital cultural objects to drive a clarifying dis- 


cussion of what we might mean by terms such as “digital” in the first place. 


It is my hope that the coming pages provide the above to readers both convincingly 


and pleasurably. 
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Introduction 


Expressive Processing 


In 1996 poet, programmer, and theorist John Cayley wrote that the innovative pro- 
cesses being developed by literary practitioners introduce “a new element into the 
critical understanding and assessment of new literary objects. We must begin to 
make judgements about the composition of their structure — to assess, for example, 
the structural design or composition of the procedures which generate literary objects 
— not only the objects themselves” (p. 168). 

It has been a decade since Cayley wrote these words. Yet, today, we find ourselves 
in much the same position. We are aware that procedures are central to the creation 
of digital works, but we are only beginning to consider how we might interpret such 
processes themselves, rather than simply their outputs. This is true in the field of 
digital literature, the wider field of the digital arts, and in the general consideration 
of digital cultural objects: from computer games to websites. 


The goal of this study is to begin this work in earnest. 


Fixed media and process-oriented work 


We return to our favorite books and films because they bring us something new each 
time we experience them. However, it is important to understand that these media 
bring us something new because we — their audiences — experience them anew. 
What they show us, on the other hand, is the same each time. These media are 
essentially fixed, stable — saying the same words and showing the same images, in 
the same order, at each encounter. 

Authors such as Cayley, on the other hand, direct our attention to the fact that 
we are increasingly experiencing media that not only say things and show things — 
but also operate. These media have internally-defined procedures that allow them 
to respond to their audiences, recombine their elements, and transform in ways that 
result in many different possibilities. These human-designed processes separate such 
media from fixed media, which have only one possible configuration. 

Of course, we have had reconfigurable and process-oriented works for a long time. 
Some of these works are operated by their audiences. Others are composed through 
processes carried out by their authors. The change, now, is the large number of works 
driven by processes that are defined within the works themselves and that are carried 
out automatically by digital computation. For the next few pages l will refer to such 


» 


works as “digital works,” while using the term “fixed media” to describe traditional 


media such as most books and films.! It is my belief that we will not be able to 





¡Some may object that, in using the term “fixed media,” I am eliding important facts: that even 
a traditional a book can be read in many ways, that a film is experienced in a time-based manner, 
and so on. This is not my intention. Rather, 1 mean to point to other aspects of these media: A 
book's words do not change from reading to reading. A film's images are shown in the same order, 


fully engage digital works until we consider not only how we experience them but 
also how they operate. Our cultural objects are undergoing a profound shift, and our 
approaches to understanding them — as creators, as critics, and as audience members 


— must expand as a result. 


Understanding systems 


In some ways we are faced with a truly dramatic transformation. Our media objects 
have been fixed for nearly our entire history as a species. Cave paintings are fixed, 
tablets bearing the Code of Hammurabi are fixed, a Gutenberg Bible is fixed — and 
so is a vinyl disk of Kind of Blue, a celluloid print of Metropolis, or a digital broadcast 
of Battlestar Galactica. From the perspective of media — one of the fundamental 
ways that we define and reproduce our cultures — the move to media objects that 
carry out processes is a profound shift. 

On the other hand, the challenge of understanding systems, and the processes 
that drive them, is not new. In fact, this challenge has been with us longer than 
we have had media. Consider, for example, the human body. We take in food and 
produce waste. We fall ill and recover from illness. We conceive children, give birth 
to them, and watch them develop into adults. All of these are results of the fact 
that the human body is a complex system driven by processes. For most of human 
history our ability to understand these systems has been limited, but also essential 
to our survival. While we could not uncover the mechanisms of these processes, we 


could observe correlations that gave us some insight. Some foods interacted with our 





with the same timing, at each viewing. 


digestive processes better than others, and cooking some foods made them a better 
fit with our processes. Some treatments seemed to help people recover from illness, 
while others did not. 

But correlational data, from observation and experiment, only took us so far. Now, 
from the relatively recent advent of approaches such as microbiology, our species has 
begun pushing a new frontier: actually understanding the operations of our bodily 
systems, so that we can see the causes of phenomena. This not only changes how 
we understand our bodies, but also offers new opportunities for intervention in its 
systems, and these systems” interactions with illnesses. 

Now, let us return to considering digital works, which also have internal processes. 
In interpreting digital works, the challenge we face is to find appropriate ways to bring 
our basic approaches for understanding dynamic systems — like the body — to bear 


on something which has always, previously, been fixed: our media. 


Digital systems 


Of course, the new forms of process-oriented media that have emerged over recent 
decades are not quite like our bodies. There are a number of important differences. 
The most profound, for our purposes, is that media such as computer games and 
websites are human-authored systems. While we are still puzzling over the operations 
of the body”s systems (e.g., gene expression) we know how these digital systems are 
composed. Human authors have defined the data files (images, sound, text), written 
the software, and designed and built each aspect of the hardware — from processor 


chips to storage devices, from display screens to input devices. 


Our challenge, then, with digital works, is not to try to unlock the mysteries of 
how these systems operate. We know how they operate, at a general level, because we 
created each aspect of the technology. So, to which of the human-authored elements 
of the system will we turn our attention, and what forms will our attention take? 

When looking at a novel such as Virginia Woolf*s To the Lighthouse we have many 
options. We can look at the book itself, in its different editions, and perhaps how 
these connect with histories of publishing. We can look at wider literary constellations 
within which To the Lighthouse is one point: perhaps feminist novels, or war novels, 
or novels that contain autobiographical elements. We could focus on Woolf as an 
individual, and the novel as one element of her life. But most of Woolf's fellow 
writers, along with most critics and other audience members, will — even if they 
make connections with these other ways of considering the novel — turn much of 
their attention to the words of Woolf's book. It is the text where most of Woolf"s 
authoring effort was focused, and it turns out to also be the element that grounds 
most of our interpretive attention. 

Faced with the processes of digital works, we must again answer the question of 
where to turn our attention. Of course, as with other media forms, we will benefit 
from taking a wide variety of approaches. But I believe that, as with Woolf”s words, 
we will find one element central. 

If we were to take a survey of writings about digital work by those in the arts and 
humanities thus far, and attempt to determine what aspect of digital systems is most 
interesting, we would first find that detailed commentaries on system operations are 


thus far extremely rare. What we have, instead, are mostly passing comments about 


the processes of digital systems, with most attention focused on the aspects of works 
that are more like traditional, fixed media. If we were to accept this limitation, and 
look at what aspect is most mentioned in these passing comments, it is likely that 
the answer would be “binary numbers.” We find a virtual sea of comments about 
the binary, zeros and ones, bits and bytes — the manipulation of such numbers 
by processors, their storage in memory, their transmission across networks, and the 
supposedly reductive and /or liberating encoding of all data in this form. But I believe 
such comments give much more attention to binary mathematics than it deserves. 

The most obvious alternative to a focus on binaries — one that has long had 
attention among computer scientists, and is gaining attention among artists and 
scholars — is a focus on “code.” Here, much as with Woolf”s words, attention is on 
the place where the effort of the author is focused. A digital system's operations are 
defined by the painstaking crafting of computer programs, the authoring of of code. 
Computer science has a tradition of viewing such code aesthetically, citing virtues 
such as economy of expression. For traditional humanists a focus on code promises 
the opportunity to employ familiar critical tools in a new context, where reading code 
becomes much like reading any other expressive text. But I believe this, too, while 
closer to the mark, does not identify the element of digital systems that will prove 
most important for our attention. 

Code is attractive to traditional critics in part because it is, like traditional media, 
fixed. But our fundamental challenge is to begin to understand dynamic media sys- 
tems. In other words, I believe we need to focus on what code is used to express and 


construct: the operations of systems. The examination of code is only one potential 


window through which we can manage to view these operations. Binary mathemat- 
ics are even less central, forming a nearly incidental support for the execution of 
computational processes. 

Learning to interpret media systems based on their operations will not replace a 
focus on audience experiences of media, just as the study of microbiology does not 
replace an understanding of gross anatomy. Rather, it will complement and expand 
audience-focused understandings. We will move beyond understanding works that 
embody digital processes, that are dynamic systems, through frameworks developed 
for fixed media. We will be able to situate, critique, and learn from how these media 
operate, not just how they appear from a typical experience. 

Meeting this challenge will require new perspectives and new vocabularies. It 
will also require new histories, as earlier work is reconsidered in light of our new ap- 


proaches. This study attempts to make contributions to these interconnected efforts. 


Focusing on Process 


The examples of digital work cited above (computer games and websites) are of a 
type that I will call “digital media.” More work on definitions will take place in a 
few pages — but for now let us say that digital media is “media for which digital 


computation is required at the time of audience experience.”? 





2Others use the term “digital media” differently. For example, in popular use the term can 
encompass much that 1 have referred to as “fixed media” — including fixed music distributed on 
compact disks, fixed video distributed on DVDs, and so on. This meaning can be seen in legislative 
uses, such as the “Digital Media Consumers' Rights Act of 2003,” a bill introduced in the U.S. House 
of Representatives. These types of media do “require” digital computation at the time of audience 


Authors and scholars of digital media — myself included — have mostly focused 
on interactive digital media. There are a number of reasons for this, probably not 
the least of which is that successful interactive works are doing something that non- 
digital media do rarely (and generally with much less complexity). Interactive works 
also clearly require computation in order to be what they are — the computation is 
necessary to support the interaction. But, unfortunately, that is usually about as far 
as the consideration of computational processes goes in writing about digital media. 
A work's processes are considered only to the extent that they are useful in explaining 
the audience's experience. 

This study will reverse the relationship between process and interaction found in 
most studies of digital media. Rather than interaction as the focus, with process 
moved to the periphery, here process will be the focus — but with the audience and 
interaction always present, if not given a central place in most of the discussion. 
Further, the processes considered will be shaped by a particular interest in digital 
cultural production, especially digital literature (and within that on fiction). 


In order to make this clearer, it will be useful to begin with a broader view of 





experience, but in a different sense than l outline later in this chapter. 

Given the potential for confusion from these and other issues, some prominent writers have 
abandoned the term “digital media.” For example, Lev Manovich has tended not to use the word 
“digital.” He writes in The Language of New Media: “Because of this ambiguity, 1 try to avoid using 
the word digital in this book” (2001, 52). Manovich, instead, uses the term “new media.” Janet 
Murray, on the other hand, views “media” as the problematic term, rather than “digital.” In Hamlet 
on the Holodeck (1997) she writes primarily of “digital environments.” Further, in “Inventing the 
Medium” Murray uses “medium” rather than “media,” writing: “[T]he term “new media” is a sign 
of our current confusion... How long will it take before we see the gift for what it is — a single 
new medium of representation, the digital medium, formed by the braided interplay of technical 
invention and cultural expression at the end of the 20th century?” 

Given that there is no consensus in the field, this study will seek to clarify important terms as 
they are introduced. 


Audience(s) 


Outside 
processes 4 
data sources 





Author(s) 


Figure 1.1: A model of digital media 


digital media and other digital works, within which the foci of digital literature and 


particular types of processes will be situated. 


A model of digital media 


Figure 1.1 presents a possible model for thinking about digital media. There are 
seven elements represented within it, and one element so pervasive (context) that it is 
not explicitly represented (but rather parenthetically implied next to every element). 


Here is a brief description of each of the labeled elements: 


Author(s). Those who select and create the work's data and processes. 


Data. The text, graphics, sound files, tables of numbers, and other non-process 


elements of the work. Some may be newly-created for the work, others may 
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pre-exist and be selected by the author(s), and yet others may be collected 
while the work is in operation from dynamic sources that are specified by the 


author(s). 


Process. The operations of the work, including those that respond to interaction, 
arrange data, maintain models, drive simulations, and so on. Some of these may 
be newly-created for the work, others may be selected from a set of available 
options in an authoring system such as Flash. Even newly-created processes, 
of course, are implemented using pre-existing materials (e.g., general-purpose 


programming languages). 


Surface. How the work is accessed from outside. This includes what is made available 
to the audience for interpretation (and interaction) as well as instructions to the 
audience, physical interface devices, and so on. It also includes any application 
programming interfaces or other means by which the work can receive requests 
from outside processes or make such requests — including those that allow for 


the receipt or alteration of data. 


Interaction. This is change to the state of the work, for which the work was designed, 
that comes from outside the work. It is also any change made by the work to 
outside processes or data. Such changes operate through the work's surfaces, 
either that available to the audience or that facing outside processes and data 
sources. Typically, audience interaction involves some action relative the sur- 
face (e.g., typing on a keyboard, touching a screen, moving in front of a video 


camera) and in some cases immediately produces some additional change to the 
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surface resulting from the internal state change (e.g., the typed text appears on 
screen, the touched element on the screen glows blue, the audience member's 
image in a projection mirrors their movements before the camera) but not in all 
cases. When this sort of immediate change takes place it can enable the work 


to act as a conduit for real-time communication between audience members. 


Outside processes and data sources. The work may incorporate data from outside 
sources that — rather than being input manually by the authors — is automat- 
ically requested by the work's processes (e.g., news headlines, weather data). 
Similarly, the work might be designed to make changes to outside data sources 
other than those that contribute to the surface presented to the audience (e.g., 


a transaction database fed by many works). 


Audience(s). The work may be presented to one or more persons, at one or more 


locations, and at one time or repeatedly. 


As stated above, context must be understood to exist as an important part of 
interpreting each of these elements. More specifically, both the work of authoring 
and the work of audience interpretation /interaction take place in social and material 
contexts. Similarly, data and process can only exist in a wider context of computation. 
For example, in the case of web-based work the data and processes defined and 
selected by the author(s) exist in the context of web server software/hardware and web 
browser software/hardware. The context of the web server and browser determine 
which types of data and process are supported — as well as shaping the surface 


interpreted by the audience and the potential for interaction. 
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Figure 1.2: A focus on process, data, and author(s). 


Digitally-authored work 


In order to focus on process, as discussed earlier, this study backgrounds some ele- 
ments of the model in figure 1.1. Specifically, it backgrounds those aspects of digital 
media most discussed by contemporary critics: surface, interaction, and audience. 
Figure 1.2 is an attempt to represent this graphically. The grayed elements have 
not disappeared, but it is clear that our attention turns to process and data, their 
authoring, and the surrounding context. 

When this takes place, the resulting area of focus is still of interest to digital 
media — but it also becomes of interest in understanding other types of digital work. 
Specifically, by backgerounding surface and interaction we are turning away from the 
elements of digital media that require computation at the time of audience experience. 


Process, data, and authoring are also — in addition to their importance for digital 
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media — the primary elements of what this study will call “digitally-authored work.” 
This is work that requires computation before the audience experience, but not during 
16. 

Of course, many of the computational processes that shape our lives are digitally 
authored, rather than being digital media. When we take a trip by airplane, the 
airport shuttle route planning need not be interactive, nor the weather simulations, 
nor the Transportation Safety Authority's search through databases to determine 
who is a suspect. Each is a computationally intense process — but it is a process 
that can be run when needed and produce an output that is interpreted later. 

Digital cultural production — digital art, computer games, etc. — includes both 
digitally authored and digital media work. The same is true of digital literature, which 
is a permeably-bordered subcategory of digital cultural production. An attempt to 
visualize the embedding of digital literature within digital cultural production, as 
well as the combination of digitally authored and digital media work within them, 
appears as figure 1.3. 

In this figure, digital media appears on the left, while digitally-authored work 
appears on the right.2 The focus of most discussions of digital cultural work, then, 
has been on the projects that occupy the left side of the figure. The approach of this 
study, however, might be visualized somewhat differently: beginning with a gradient 
rather than a clear division. This gradient runs between “process-intensive works” 


and “process-light works.” Process intensity is a concept that I borrow from game 





3The symmetry of the two halves of this diagram, as with others in this chapter, is due to the 
diagram's schematic nature, rather than to a belief that these areas of the field are of equal sizes. 
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Digital Media Digitally Authored 


Figure 1.3: Digital literature (dotted oval) is seen as within digital cultural production 
more generally (dashed oval) and shares its characteristic of including both digital 
media (left half, clear) and digitally-authored work (right half, hatched). 


designer and theorist Chris Crawford — whose outlining of the concept, along with 
my particular approach to it, will be discussed in the next chapter. For now, figure 1.4 
represents the process intensity gradient. Process intensity is in the lower part of the 
figure, becoming lighter as we rise, operating along an axis that runs perpendicular 
to the distinction between digital media and digitally-authored work. 

However, while process intensity helps us begin to identify this study's area of 
focus, it is not sufficient on its own. As the next chapter will discuss in more depth, 
the concept of process intensity treats all processing as the same. However, some 
processes carried out by digital works have a much more meaningful influence on 
their operations. 

For example, many computer games allow the audience to change the level of de- 
tail at which their graphics will be rendered. High-quality graphics are very process 


intensive, and may make the gameplay less responsive if the audience member's ma- 
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Digital Media Digitally Authored 


Process Intensive 


Figure 1.4: Process-intensive work (most intense at bottom) forms, with process-light 
work, a gradient that runs across the distinction between digital media (left half) and 
digitally-authored work (right half, hatched). 


chine finds them a challenge to compute. Varying the graphical detail, while it does 
vary the game's process intensity, is not considered a meaningful change to the game 
— so the game is seen as the same with varying levels of graphical process intensity. 

On the other hand, games treat it quite differently when audience members are 
invited to lower the intensity of another process that can be quite demanding: the 
artificial intelligence (Al) that shapes the behavior of non-player characters (NPCs) 
or the computer's play style. Players are able, in some games, to lower the intensity 
of Al processes — but as a way of adjusting the “difficulty level” of the game. That 
is to say, with lower intensity in their Al processes, games are no longer considered 
to offer the same gameplay experience — whereas with lower graphical intensity the 


gameplay experience is considered the same. 
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The focus of this study will be on processes such as Al, which 1 refer to as “ex- 
pressive processes.” These are processes that clearly shape the meaning of works. 

Within this area of focus, two types of expressive processes will be considered: 
mainstream processes embraced as general practices by a field and processes that are 
newly defined for particular works. Mainstream processes reflect the field”s larger 
structure and and influence the shape of projects within it. Newly defined processes 
are of interest for a number of different reasons. For newly-defined processes it is 
often apparent how the processes and data have been designed to work together, the 
context of the process is often easier to identify, and, overall, it is generally clearer 
how the processes operate as part of the work's expression. Figure 1.5 represents 
an attempt to visualize works for which processes are newly-defined — which occur 
throughout the process-intensive and process-light areas, as well as throughout digital 
media and digitally-authored work. Later in this study the interpretation of specific 
works and wider practices will intertwine in the considerations of story generation 
and natural language generation. 

As the mention of these two types of generation suggests, this study's focus on 
process in digital literature — putting aside interaction — means that our attention 
will be drawn to computationally intense, but non-interactive, forms of digital liter- 
ature. Particularly, this study will focus on processes that are of interest from the 
perspective of fiction. Within this, poetry and other forms of experimental writing 
will also be discussed. 

In fact, a number of examples discussed in this dissertation are not digital in any 


way — but rather forms of writing for which processes are important, but where 
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Figure 1.5: Works with newly-defined processes (dots) occur against the backdrop of 
a larger number of works that employ pre-defined processes. Newly-defined processes 
occur both in process-intensive (bottom) and process-light (top) areas of endeavor; in 
digital literature (dotted oval), digital cultural work (dashed oval), and wider areas 
employing digital computation (figure as a whole); as well as in digital media (right) 
and digitally-authored work (left, not hatched in this figure). 


these processes are carried out by the author or the audience. One of the goals of 
this study is to clarify how comparing such processes with digital processes can be 


both informative and potentially misleading. 


Understanding the “Digital” 


Now that the area of focus for this study has been outlined, it is worth directly 
addressing a rather vexing word for studies of digital work: “digital.” 
As just established, “digital media” is this study's term for media that requires 


digital computation at the time of audience experience, while “digitally-authored 
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work” denotes projects that require digital computation in their authoring but not 
during audience experience. In a related vein, “digital literature” and “electronic 
literature” are two of the most common umbrella terms for literary work that requires 
the use of a computer (for authoring and/or the audience). In all four of these 
terms, the words “digital” and “electronic” are individual words standing in for a 
longer phrase from computer engineering, the key phrase that characterizes modern 


” Much of this section will 


computers: “stored program electronic digital computer. 
be dedicated to unpacking this phrase. 

Of course, from the perspective of computer science this phrase needs little un- 
packing. But a primary goal of this study is to reach an interdisciplinary audience, 
including interested artists, writers, and cultural critics. Computer scientists may 
wish to skip the discussion of stored program computers (pages 20-29) or may be 
interested to see how this study attempts to employ a short discussion of ideas at the 
root of their discipline for interdisciplinary purposes. 

The definitions of “digital media” and “digitally-authored work” used in this study 
bring the word requires to our attention. What does it mean that a work of literature 
requires a computer, that it requires digital computation? If I take a Virginia Woolf 
story and send it via email, does it become digital literature? Certainly, like anything 
transmitted or displayed by a computer, it invokes computational processes — and 
in this sense a computer is required. But it is also certain that those processes have 
nothing to do with Woolf*s work — it does not require them in order to be itself. So 
unless I view my act of situating Woolf”s work in an electronic context as the creation 


of a new work, one which requires digital computation in order to be itself, we can 
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safely say that this is not digital literature. We can say the same, in fact, of all work 
that uses the computer only as a distribution and display mechanism — which could 
be substituted for any other(s). An Alfred Hitchcock movie distributed as digital 
video is in the same category. There is nothing wrong with such work. It is simply 
media distributed and/or experienced using a computer, rather than digital media.?* 

On the other hand, we can see that work which explicitly includes processes 
of digital computation in its definition, such as a computer game, requires digital 
computation. The same is true of work that is explicitly designed for its surfaces to 
be experienced in a context employing digital computation, such as an email narrative. 
Certainly in many cases such work could be moved into other contexts, and perhaps 
necessary computation could be carried out by other means (e.g., by human effort, or 
specially-designed equipment). But this would then be documentation or translation 
of the work. In some cases the impact of a translation of this sort is relatively minimal. 
The audience experience of the work might remain substantially the same. But in 
other cases the work becomes quite different — as when processes once carried out 
invisibly by the work must now be understood and executed by the audience. 

We can see the broad outline of these issues boiled down to a few concise lines 


in the definition of “electronic literature” on the website for the Electronic Literature 





“Similarly, tools for authoring digital works are not digital media. This may be most obviously 
true in the case of digital tools for authoring fixed media. For example, digital imaging tools like 
Photoshop create fixed images that are often printed on paper (involving no computation at the 
time of audience experience). In this sense digital image tools like Photoshop are much like LaTeX 
(the software used for the layout of this study) and other digital tools aimed at putting words on 
paper. But I would further argue that tools such as Flash, which are largely used to create digital 
media, are not themselves primarily experienced as media. Rather, our experience of them is much 
like our experiences of tools such as Photoshop and LaTeX. In other words, 1 believe it is useful to 
distinguish between our experiences of tools and media. 
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Organization (ELO, 2005) (one of the field's most active non-profits): 


The term refers to works with important literary aspects that take ad- 
vantage of the capabilities and contexts provided by the stand-alone or 
networked computer. 


In the field, “digital literature” and “electronic literature” have roughly equivalent 
meanings, and 1 will use the two terms interchangeably, along with the abbreviation 
“elit” (which is much more common than “diglit”). 

The terms “media” and “literature” I will not define here, and instead will bow 
in deference to the long, complex, and ongoing discussions of the meanings of these 


terms within the disciplines of media studies and literary studies. 


Stored program digital computers 


I say that the words “digital” and “electronic” are being used to reference the longer 
phrase “stored program electronic digital computer.” But what does that phrase 
mean, more precisely? This is the name of a particular approach to computer ar- 
chitecture — to designing and building computers — that has been dominant since 
the 1950s. The development of this type of computer grew out of a series of progres- 
sive abstractions away from idiosyncratic computing devices, computers designed to 
solve one problem (or a small group of problems). The overall philosophy behind the 
stored program architecture approaches computers as “universal machines,” capable 
of simulating the operations of any type of calculating machine. 

The stored program architecture has enabled a series of further abstractions such 


as operating systems and “high-level” programming languages (which can be read in 
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a manner similar to natural languages). These, in turn, have made the specification 
of relatively abstract processes, rather than the manipulation of hardware-level in- 
structions, the primary work done with computers by most practitioners — including 
almost everyone creating media or literary work with computers. We can understand 
this more deeply if we take a brief look at each of the words in the phrase “stored 
program electronic digital computer.” Doing so will help us see why it is not the 
lower levels of this tower of abstractions (such as those that store, manipulate, and 
transmit binary numbers) that deserve our primary attention as we seek to interpret 


the operations of digital systems. 


Computer 


In 1937 everyone who used the term “computer” knew what it meant. A computer 


was a person who calculated answers to mathematical questions.? These computers 





5To be precise, “computer” was a job title, describing a position often held by women. As 
N. Katherine Hayles writes in the prologue to My Mother Was a Computer (Hayles, 2005): “in 
the 1930s and 1940s, people who were employed to do calculations — and it was predominantly 
women who performed this clerical labor — were called “computers.. Anne Balsamo references this 
terminology when she begins one of the chapters in her book Technologies of the Gendered Body with 
the line I have appropriated for my title: “My mother was a computer.? Balsamo's mother actually 
did work as a computer, and she uses this bit of family history to launch a meditation on the gender 
implications of information technologies.” In the 1950s, an answer sometimes given to those who 
feared the growing role of mechanical computers in the workplace was that it would mostly only 
replace women: human computers and clerical workers. Christopher Strachey, whose work will form 
the focus of the next chapter, wrote (1954), “I do not think that the introduction of computers into 
offices in this way will produce any very marked disturbances. It is true that a very large number 
of people will ultimately be replaced by these machines — possibly as many as a million in the 
next ten years — but the process will be a gradual one. Perhaps more important is the fact that 
there is, in any case, a very large turnover in the clerks they will replace. The majority of these are 
young women who leave after a few years to get married, so that the introduction of a computer 
will probably not throw very many people out of work. It will merely stop the intake of new clerks, 
who will presumably have to seek other occupations. What these will be, is an interesting matter 
for speculation.” (p. 31) 
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were not expected to develop new, creative methods to prove outstanding mathemat- 
ical problems. Rather, they were expected to follow a known and specified set of 
instructions which, together, formed an effective procedure for answering a particu- 
lar kind of question. We call such sets of instructions algorithms (from the name of 
Arabian mathematician al-Khwarizmi). 

But in 1937, when Alan Turing was a young researcher at Cambridge, the world 
was quietly introduced to a thought experiment that lay important groundwork for 
the kinds of non-human computers we have today. This introduction was provided by 
the publication of Turing's paper “On Computable Numbers” (1936) which outlined 
what we now call a “universal Turing machine.” Turing's paper was not remarkable 
for imagining a machine that could carry out the work of human computers. In the 
1930s there were already in operation a number of such machines (including Vannevar 
Busb's Differential Analyzer) and at least 100 years earlier (by 1837) Charles Babbage 
had conceived of an Analytical Engine, capable of mechanizing any mathematical 
operation.* Two things, however, separated Turing machines from all calculating 
machines in operation in the 1930s (and most of the 1940s) as well as all previous 
thought experiments (including Babbage's). 

First, the Turing machine was developed in response to a mathematical question 
(posed by Hilbert) as to whether mathematics was decidable. That is, was there 


a method that could be applied to any assertion that would correctly determine 





SThe design of the Analytical Engine called for it to be programmed via punched cards such as 
those used for automated looms, making it possible for Babbage's collaborator Ada Lovelace to be 
called by some the first programmer of a universal computer, even though the Analytical Engine 
was never constructed. 


23 


whether that assertion was true? The Turing machine was a formalization that made 
it possible to discuss what could and could not be calculated — answering Hilbert's 
question in the negative, and establishing one of the primary foundations for computer 
science as a science (the investigation of what can and can't be computed). 

Second, the imagined design of the Turing machine was in terms of a potentially 
implementable (if inefficient) mechanism. This mechanism was such that it could not 
only logically branch while following its instructions (doing one thing or another based 
on results to that point) but also act as a universal machine (simulating the activities 
of any other calculating machine). Turing demonstrated an important aspect of this 
universality in his paper. His proof that some numbers can't be calculated rested 
on what is called the “halting” problem, which asked whether a Turing machine 
could be constructed that, when given the description of another Turing machine 
as its input, could determine whether the operations of that Turing machine would 
definitely halt when calculating based on a specific input. The basic move of this 
argument requires that a universal Turing machine be able to simulate the activities 
of another Turing machine. This provides the conceptual foundation for the tower 
of progressively abstract computing machines at work in a modern stored program 


computer.” 





7A computing machine that can calculate anything calculable by a Turing machine is often called 
“Turing complete.” 
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Digital 


But before we delve further into discussion of stored program computers, let us ad- 
dress the words “electronic” and “digital.” The first of these can probably pass 
without definition — but the second is in need of clarification, especially given the 
mystifying ways in which it is sometimes used. Digital, as it turns out, is not a word 
specific to computers, despite the fact that it is the word we often latch onto in order 
to represent computers. “Digital” information, as opposed to “analog” information, 
is represented by discrete rather than continuous values. It is actually related, ac- 
cording to the Oxford English Dictionary, to the sense of fingers and numbers as 
“digits.” Each of the first nine Arabic numbers (or ten, if one includes zero) can be 
expressed with one figure, a digit, and these were originally counted on the fingers, or 
digits. Charles Babbage's Analytical Engine called for representing decimal numbers 
using ten-spoke wheels — which made it a design for a digital computer, because 
each of the ten wheel positions was discrete.9 During WWII Konrad Zuse built the 
first program-controlled digital computer that, instead of Babbage's decimal arith- 
metic, used binary arithmetic implemented in on/off electrical devices.? This was a 


considerable simplification and set the stage for advances in speed and precision that 





8In contrast, many early 20th century computers used analog, continuous representations — such 
as varying electrical currents or mechanisms that turned at varying speed. These analog computers 
could perform some tasks very quickly. For example, adding two quantities represented by electrical 
currents could be accomplished simply by allowing flow onto particular wires, rather than by actually 
establishing the two values and numerically calculating their sum. However, because of the lack of 
discrete states, analog computers were inflexible in their orders of precision and prone to noise- 
induced errors. 


“Turing machines, though they are mathematical entities rather than physical ones, preceded 
Zuse's machine in being designed as binary digital computers. 
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are important to our “digital” computers. Working independently (and very secretly) 
the British government cryptanalysis group of which Turing was part (and where he 
was instrumental in cracking the German Enigma code) created the Colossus, which 
has been characterized as the first fully functioning electronic digital computer. 

This brings us to a central point, as we evaluate previous discussions of the “dig- 
ital” from within the arts and humanities. The on/off (or one/zero) of binary arith- 
metic is often taken as synonymous with “digital.” In fact, for something to be digital 
it need only be represented by discrete values — whether binary values represented by 
electronics, decimal values represented by Babbage's ten-spoked wheels, or base-five 
values represented by chemical strands. A digital computer could be built on any of 
these systems. And the way that the computer's data is stored and processed — in 
binary electronics or base-five chemical encodings — would make no difference to us 
if implemented efficiently. Failure to understand this has led to a mysticism of the 
digital as binary, a feeling that theorists grappling with computing need to get down 
to the core and understand things in a way that proceeds from ones and zeros. 

Jean Baudrillard is perhaps the the theorist most commonly invoked at the outset 
of these confused forays into binary-based interpretations of culture and computing. 
And, in fact, scholars may be correct to cite Baudrillard as explaining something 
about our culture when he writes, “the true generating formula, that which englobes 
all the others, and which is somehow the stabilized form of the code, is that of binarity, 
of digitality” (1983, 145). But Baudrillard is not explaining anything about computers 


with these words.*% In computers binary numbers are serving simply to encode in 





lOBaudrillard's focus, instead, is primarily on issues much larger than the operations of digital 
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discrete values any sort of information: the curve of an arc, a series of alphabetic 
letters, a mathematical formula. Yet theorists such as Sadie Plant cite Baudrillard 
in an unfortunate way, as though his writings give insight into the operations of 
computational systems, and further extrapolate in statements such as “the stark 
reductionism of binary code reinforces the binaries of the modern sexual economy.” 
On the other hand, Plant writes, later in the same paragraph, that “introduction of 
binary code introduces a plane of equivalence which undermines the very foundations 
of a world in which male and female have played the roles of superstructure and 
material base”* (1996). But such statements can only be true to the same extent that 
the 26-letter Roman alphabet carries out “stark reductionism” and yet “introduces 
a plane of equivalence” through the material it encodes, such as Plant's writing. A 
binary encoding scheme tells us little about, and has little impact on, the meaning of 


what is encoded. 





computation: the ways that we understand and operate our culture as a whole. He writes: “Thus 
we find once more in history that delirious illusion of uniting the world under the aegis of a single 
principle — that of a homogenous substance with the Jesuits of the Counter Reformation; that of the 
genetic code with the technocrats of biological science (but also linguistics as well), with Leibniz and 
his binary divinity as precursor” (p. 109-110). “The entire system of communication has passed from 
that of a syntactically complex language structure to a binary sign system of question/answer — 
of perpetual test” (p. 116-117). “From the smallest disjunctive unity (question/answer particle) up 
to the great alternating systems that control the economy, politics, world co-existence, the matrix 
does not change: it is always the 0/1, the binary scansion that is affirmed as the metastable or 
homeostatic form of the current systems” (p. 134-135). 


1Plant's preceding evocation of Baudrillard reads: “Messengers, messages, and the points be- 
tween which they circulate are coded in the 0 and 1 of binary maths, an off/on switch which is, as 
Baudrillard writes, 'no longer a distinctive opposition or established difference. It is a “bit”, the 
smallest unit of electronic impulse — no longer a unit of meaning |...] This is what the matrix of 
information and communication is like, and how the networks function.” ” The ellipses are in Plant's 
original. 
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Stored program 


To return to “stored program” — the remaining words in need of discussion from the 
phrase “stored program electronic digital computer” — it was only after the Second 
World War that a number of successful efforts were made toward such computers. 
Stored program computers, in a manner reminiscent of Turing's machine, store in- 
structions in the same read /write memory as the data on which they act. But they 
do it via mechanisms much more efficient than the endless tape of Turing's thought 
experiment, and one theoretically intriguing result is to allow the machine to alter 
its own instructions while in operation. It is from this type of architecture that we 
get the words “stored program” in the phrase “stored program electronic digital com- 
puter.” This lies at the heart of the computers we use each day, including the laptops, 
desktops, servers, cellphones, game consoles, interactive environment controllers, and 
other computers employed by authors and audiences of digital literature. 

Turing had a design for such a computer in mind, developed with Don Bayley, 
while the war was still being fought. In the summer of 1945 he received support to 
start work on it from the Mathematics Division of the National Physical Laboratory, 
leading to the ACE project, but progress was slow. Meanwhile, across the Atlantic, 
a team at the University of Pennsylvania completed the ENIAC just after the war's 
end. (This was believed, while the Colossus was still secret, to have been the first 
fully functioning electronic digital computer.) While work on the ENIAC was still 
underway, a 1945 report on future design plans — based on insights from ENIAC 
designers J. Presper Eckert and John Mauchly, working together with John von Neu- 


mann — was the first important document to describe the concept of stored program 
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digital computers (leading to the rather inappropriate name “von Neumann archi- 
tecture” for such systems). The following summer, an invitation-only lecture series 
spread word of the Pennsylvania team's EDVAC design to other computing research 
groups. From there stored program computer research gathered momentum in many 
contexts.?? 

The first group to build an operational machine was at Manchester University. 
Their “Baby” was completed in 1948 (it used a CRT display for its storage) which 
was followed by a more complex Manchester prototype in 1949 and then replaced by 
an industrially manufactured version, the Ferranti Mark I, in 1951.*% Turing, having 
left the frustratingly slow NPL project behind, wrote the programming manual for 
the Mark I and constructed a random number generator that produced truly random 
digits from noise. It was this machine, this programming manual, and probably this 
random number generator that Christopher Strachey used for the love letter generator 
and checkers-playing program that will be discussed in detail in the next chapter. 

So, what was it that Strachey was using? What is a stored program electronic 
digital computer? For our purposes, it is not necessary to understand all the details 
of the differing designs for computers over the half century during which experiments 
in digital media have taken place. Rather, it is important to understand that the 


fundamental model of computation has remained the same since Turing's pioneering 





12This also marked the beginning of developments that lead to digital computers that could be 
manufactured and sold by companies focused on supplying information management and calculation 
equipment to industry. This story is told in an enlightening fashion by Martin Campbell-Kelly and 
William Aspray's Computer: A History of the Information Machine (2004). 


ISOther notable efforts include the University of Cambridge EDSAC (1949), the University of 
Pennsylvania EDVAC (1951), the MIT Whirlwind I (1949), and others. 
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work. The stored program digital electronic computer has served to make this model 
practical and allowed for the increasingly abstract specification of processes. 

Digital computation, using discrete values, is a move away from continuous rep- 
resentations specific to the machine in question. Stored program computation, which 
moved the machine's instructions into the same memory as its data, is a move away 
from programming instructions by physically altering the machine (or by input de- 
vices accessed during execution, or by specialized memory stores). Both allow for 
greater speed of process execution and greater portability of processes from one ma- 
chine to another. Both allow for the construction of more efficient universal machines 
— machines that can simulate the processes of other machines. 

The result, as argued earlier, is a continuing movement away from the specifics 
and idiosyncrasies of a particular device, a particular computer, and toward the 
increasingly abstract specification of processes. This movement will carry on for 
the foreseeable future and, as it has since the early 1950s, it will likely continue to 
take place within the context of the same basic concept of the electronic digital stored 
program computer. It is on this platform for specifying and executing processes that 
we have built (through the construction of operating systems, compilers, high-level 
languages, virtual machines, and so on) increasing levels of abstraction from the 
specifics of any particular machine. And it is this that motivates my focus on the 
operations of digital processes — on what they do, rather than how they are stored 
(magnetic or optical devices), what carries out the instructions (silicon wafers), or 


any other largely irrelevant ephemera of this moment in computing. 
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Beyond Code Aesthetics 


Having delved this far into definitions of digital computation that build on its field 
of origin — computer science — one might well ask why this study does not build its 
approach to reading processes on a similar foundation. Computer science includes, 
after all, not only technical measures of process efficiency, but also a culture that takes 
an aesthetic view of processes (and especially of their artful definition in program- 
ming language code). Unfortunately, this aesthetic viewpoint is embraced largely 
by computer science's unofficial culture (rather than as part of official disciplinary 
knowledge) and inspires a type of interpretation that does not encompass what this 
study sees as the most important elements of reading processes. 

Nevertheless, the computer science approach to reading code can provide an im- 
portant complement to the interpretation of processes advocated in this study. There 
is much of value in this tradition, both in the work done from a computer science 
perspective and in the (unfortunately quite rare) work that views computer science's 
aesthetic practices from an interdisciplinary perspective. A noteworthy example of 
the latter sort of work is Maurice Black's The Art of Code (2002). In this study 
Black outlines a history of increasingly aesthetic (rather than purely mathematical 
or functional) views of computer programming, and identifies a widespread literary 
attitude toward code that developed in the first few decades of the stored program 


computer: 


[Elvolutions in computing architecture and programming methodology 
during the 1950s and 1960s allowed programmers to conceptualize writing 
code as a literary endeavor. During the 1970s, a new dimension was added 
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to programmers' sense of code as a complex art: it was then that computer 
scientists began to think of code as something that could be profitably 
read as one would read a work of literature, as something whose form 
and content are best revealed and appreciated by way of detailed line- 
by-line exegesis. Formalist analysis of code became a pedagogical norm 
during the 1970s: computer science professors began putting source code 
on their course reading lists, students of computer science began studying 
source code on their own and in reading groups, and computer science 
professionals began reading source code to improve their programming 
skills. (p. 119) 


This is certainly the case, though not a complete picture. The second half of the 
above paragraph, which describes reading code as a widespread pedagogical practice, 
is indisputably correct. But it leaves us with questions. What kinds of readings are 
practiced, and toward what ends? 

Before we get to those questions, an earlier passage in the quotation above requires 
a bit more unpacking. When we read of code being seen as “something that could be 
profitably read as one would read a work of literature” Black does not simply mean 
that it can be fruitfully read line-by-line. He also means to point toward a perspective, 
held by many prominent members of the computer science community, that code 
can be profitably read like a work of literature because well-written code is, in some 
sense, essentially literary. In establishing this, Black quotes figures like Donald Knuth 
(widely respected author of The Art of Computer Programming) who has stated 
“Programming is best regarded as the process of creating works of literature, which 
are meant to be read” (1992, ix). 

In pointing toward this, Black is describing the culture of computer science, espe- 


cially of programmers, which is not quite the same as the discipline. The discipline of 
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computer science, in both the university and business, has tended to read processes 
largely in terms of issues like computability and, especially, eficiency — efficiency of 
execution (speed), efficiency of production (writing), efficiency of maintenance (con- 
tinual rewriting), or other efficiencies. The CS discipline may laud the practices of 
figures like Knuth, who advocates “literate programming.” But practices such as 
literate programming are largely supported because they produce well-documented 
code which is more efficiently de-bugged and maintained by groups of programmers 


over time.** 





MTn fact, even in the act of presenting Knuth with the Turing award (the Nobel of computer 
science) disciplinary CS could not bring itself to officially recognize his insistence that programming 
be viewed as an artform. Instead, the ACM couched his contributions in technical and pedagogical 
terms, with the award officially granted “for a number of major contributions to the analysis of 
algorithms and the design of programming languages, and in particular for his most significant 
contributions to the “art of computer programming' through his series of well-known books. The 
collections of techniques, algorithms and relevant theory in these books have served as a focal point 
for developing curricula and as an organizing influence on computer science” (Knuth, 1974, 667). 
Knuth, for his part, pointedly named his acceptance lecture “Computer Programming as an Art” 
and began it in this way: 


When Communications of the ACM began publication in 1959, the members of ACM's 
Editorial Board made the following remark as they described the purposes of ACM's 
periodicals: “If computer programming is to become an important part of computer 
research and development, a transition of programming from an art to a disciplined 
science must be effected.” Such a goal has been a continually recurring theme during 
the ensuing years; for example, we read in 1970 of the “first steps toward transforming 
the art of programming into a science.” Meanwhile we have actually succeeded in 
making our discipline a science, and in a remarkably simple way: merely by deciding 
to call it “computer science.” 


At the same time, it is worth noting that Knuth has presented a pragmatically-grounded case for 
the practices he advocates. In the same preface in which he writes “Programming is best regarded 
as the process of creating works of literature” he also states: 


Literature of the program genre is performable by machines, but that is not its main 
purpose. The computer programs that are truly beautiful, useful, and profitable must 
be readable by people. So we ought to address them to people, not to machines. All 
of the major problems associated with computer programming — issues of reliabil- 
ity, portability, learnability, maintainability, and efficiency — are ameliorated when 
programs and their dialogs with users become more literate. (1992, ix) 
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Of course, this is probably the inevitable result of computer science's status as 
a discipline. Scientific results cannot be officially lauded because of their inherent 
beauty, but because of their correctness. Technological results cannot be officially 
lauded for their virtuosic dimensions, only for their improvements over existing ap- 
proaches. It is only the unofficial culture within a technoscientific field that can 
appreciate results for their aesthetic dimensions. 

And certainly the culture of computer science reads processes aesthetically. From 
asides delivered in lectures at the beginning of computer science education through 
hallway discussions at the most advanced specialist conferences, computer scientists 
express aesthetic appreciation for well-written code. But what sort of aesthetic ap- 
preciation is this? For what is code admired? What literary qualities does it exhibit? 

As it turns out, the literary qualities admired in unofficial programming culture 
are the same ones generally explicated to students in a pedagogical context: elegance 


and concision.** Black writes of the unofficial culture: 


Early programming aesthetics were born out of necessity — given the sig- 
nificant constraints of memory and processing power in early computers, 
programmers needed to make programs perform complex tasks with as 
few instructions as possible. Writing elegant, compact code was quickly 
elevated to the status of an artform. (p. 100) 


Later, writing of the pedagogical movement that began with readings of the Unix 





l5Though, on an even more unofficial level, all kinds of cleverness are valued — including re- 
markable clarity of the sort advocated by Knuth, but also various types of creative obfuscation, as 
well as a vast number of other practices. However, these less prominent aesthetics are not within 
Black's area of focus. For a treatment of some of them see Michael Mateas and Nick Montfort's “A 
Box, Darkly: Obfuscation, Weird Languages, and Code Aesthetics” (Mateas and Montfort, 2005). 
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operating system, Black writes: “to see a line of code as beautiful, one must be able 
to see why, of all the ways the line could have been written, this way is the most 
compact, the most effective, and hence the most elegant” (p. 127). Furthering an 
explicit comparison to the close reading techniques of 20th Century “New Criticism,” 
Black goes on to say that, in this pedagogical tradition, “A program's merit is assessed 
much the way a poem's would be, in terms of structure, elegance, and formal unity” 
(p. 131). 

These are interesting foci of reading, particularly for those who are involved di- 
rectly in computer programming, but they entirely avoid the issues I hope will be 
raised in attempts to learn to read processes. The style of close reading of code 
that Black describes, which 1 experienced in my own computer science education, is 
focused on what the problem is, how it will be addressed (the technical approach), 
and how that approach is expressed in code (especially when it is particularly elegant 
and/or concise). It is entirely silent on questions of why — why the problem is being 
addressed and why the processes operate as they do. To put it another way, it is 
silent on how we might interpret the existence and functions of the system, rather 
than admire the way that those functions are expressed in code. And it is for this 
reason that the computer science approach of reading code in a literary manner is a 


potential complement to future attempts to read processes, but not a model for it. 
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Accessing Processes 


In order to interpret processes, of course, we must have access to them. Getting 
access is not always easy, and in some cases the results of attempted access may be 
speculative. Further, interdisciplinary investigation is required for many approaches. 
On the other hand, the human sciences have a long tradition of studying hidden, 
or partially hidden, processes.** Books such as Sigmund Freud's Interpretation of 
Dreams and Jokes and Their Relation to the Unconscious can be seen as describing 
strategies for accessing the operations of the unconscious's hidden processes. We 
might make similar observations about the works of Karl Marx (hidden economic 
and ideological processes), Noam Chomsky (hidden linguistic processes), and many 
others. 

Our task, of seeking access to the processes of digital works, is much simpler than 
those undertaken by Freud, Marx, and Chomsky. We do not have to attempt to 
determine the nature of the processes at work — their nature is known, as it is de- 
termined by the basic architecture of the stored program electronic digital computer. 
Also, in many cases we have access to the process specifications themselves, or to 
detailed descriptions of the process designs. However, the issue should not be treated 
as trivial. Given this, a few approaches for gaining access to processes are outlined 


below. 





16T am indebted to Wendy Chun for this observation. 
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Source code 


The source code of a digital work has a dual role. On one hand, it is a description of 
the mechanisms of the work. On the other hand, it is the closest thing to the actual 
mechanisms that most authors of digital works encounter. The code is automatically 
compiled or interpreted into a form enacted by the computer. Any error in the code 
will become an error in the operations. This ensures a certain fidelity between the 
code's description of a process and the actual operations of the process. 

On the other hand, there are important reasons that we should not depend on 
source code for our primary route to information about the operations of the pro- 
cesses of digital works. First, source code is available for only a small proportion of 
digital works. Second, source code is notoriously difficult to interpret — presenting a 
significant challenge, often, even to the individual or group responsible for writing if. 
Third, engaging with the specifics of the source code, while it allows us insight into 
issues such as individual programming style, is not always an important investment 
for understanding a work's processes.” 

For an example of these issues, let us look at the first process discussed in detail 
in this study: Christopher Strachey's 1952 love letter generator, which forms the 
focus of the next chapter. While the source code for many works of the early stored- 


program era is lost, the code for Strachey?s generator was preserved in his papers at 





17This is as opposed to literary language, for which nuances of many types are important: wording, 
sentence construction, paragraph organization, and so on. For interpreting processes, code is an 
important potential route to understanding the algorithms at work, but in most cases we care little 
about the vagaries of code indenting, comment formatting, variable naming, and so on. However, 
certain processes can be quite difficult to abstract from their code, and in these cases code may be 
the most legible representation of the processes. 
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the Bodleian Library, Oxford University. David Durand confirmed this by visiting 
the library early in the development of this study, and then I requested a complete 
copy of the folder in question from the Oxford University Library Services. Here I 


transcribe part of the code used in producing the beginning of a letter: 


A IAS a 
AED///"RRAD/GNIL///"NOH//"YEWEJ// 


LEVOL///"ECUD///"KPOM/"TEPEWS/EHT 
E"TRA///"///"YM/UOY/ERA////""YBQUO 
Y///"R"YFQUOY/ 





IvI= 


From the photocopied pages of Strachey”s papers, it appears that even he needed 
to put in marks at the four-character boundaries of memory in order to make this 
code legible. And with such marks it is still a challenge to read the words that 
run, backwards, in the clumps above: “darling,” “honey,” “jewel,” “love,” “duck,” 
“moppet,” and “sweetheart.” Further, this sort of reading only serves to unearth 
the data — the lists of words — included in the code. To actually understand the 
logical operations is another matter entirely, requiring (if one is to be certain of 
one's interpretation) a relatively deep understanding of the specific computational 
environment that Strachey was using. In other words, the path from examining 
source code to acquiring a knowledge of process — from source code to the starting 
point for being able to interpret processes — can be long and winding indeed. 

Most modern source code is not quite as large a challenge to read, line by line, 
because Strachey was working before the development of the high-level, relatively 
abstract programming languages used for much digital authoring today. But the code 


for Strachey's generator is only a few pages long, while the code for even a simple 
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computer games, written in a high-level language, would be accounted in thousands 
of lines. For each line its function and relation to the larger body of code would need 
to be understood. Even with practices such as object-oriented design and extensive 
comments embedded within the code, this is a challenge continually discussed among 
software engineers. 

Luckily, there is an alternative to reading code. Let us return to the case of 
Strachey's code. The next chapter discusses an article Strachey published, in which 
he describes the love letter generator's processes in some detail. Strachey's source 
code, once we have access to this more legible process description, takes on a less 
than central set of roles — confirming that the generator was actually built, verifying 
Strachey*s report of its operations, providing a complete set of the work's data ele- 
ments, and perhaps providing the opportunity to study the “signature” of Strachey's 
particular coding style for any who are interested. All of these are potentially helpful 


or interesting, but none are essential to the interpretation of processes. 


Process descriptions 


Strachey is far from alone in having written relatively detailed descriptions of the op- 
erations of his processes in a form other than code. In fact, the discipline of computer 
science, which was being born at the time of Strachey?'s work, makes it a requirement 
for nearly all within the field. This is because the work of most computer scientists 
is focused on the development of innovative processes. In order to receive credit for 
the development of the ideas expressed in these processes, computer scientists pub- 


lish detailed descriptions in conference proceedings, journal and magazine articles, 
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doctoral dissertations, popular and technical books, and so on. In researching the 
processes examined in this study, these have been my primary sources. They are more 
commonly available than source code, clearer to read than source code, and (often 
revealingly) combine process-oriented commentary with a focus on the elements of 
the processes in question that are considered most interesting by their authors. 
While publishing process descriptions of the sort described above is required for 
academic computer scientists, it is also practiced by those outside the academy. For 
example, later in this study 1 briefly discuss the AI employed in the computer game 
F.E.A.R. My knowledge of the game's workings comes from descriptions written by 
the author of the processes, who wrote a detailed paper about the system, presented 
the paper with additional material at the 2006 Game Developer's Conference, and 
then posted both the paper and his presentation slides on his website, in addition 
to discussing them further with visitors to his blog. The details of another game 
discussed in this study, Fable, were never publicly described in the detail employed 
in this study. However, I was able to acquire more detailed descriptions through 
personal correspondence with one of the game's developers. Neither paper presenta- 
tion nor personal correspondence with researchers are required by the field of game 
development, but both, nonetheless, are not uncommon.'* 
And there are a variety of other ways that one may gain access to useful descrip- 


tions of the processes of digital systems. For example, an innovative process for which 





18The main point here is that direct communication from and with developers can be a good 
source for information about process operations — rather than any potential ethnographic value 
to understanding developer perspectives on processes. Such an ethnographic approach might be 
interesting, but it is beyond the scope of this study. 
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an individual or group seeks “intellectual property” protection may well be patented. 
The patent process requires detailed descriptions of the processes for which protection 
is requested, and these descriptions become part of public record. Another example 
is the detailed design documents produced in the course of most large software devel- 
opment projects, which are now commonly preserved by archivists (usually employed 
by the group in question, be it a company, government agency, or something else) and 
which may be made available to researchers. Another example is the in-depth project 
proposals and final reports drawn up by software developers for their clients. Finally, 
there is an increasing amount of detailed “making of” information available for digital 
works, and especially computer games — both packaged with products (in a manner 
analogous to DVD “extras”) and on company or publication websites. While I did 
not draw on any information from patent applications, design documents, proposals, 
or reports while authoring this study, I did employ some “making of” information. 
If TI had chosen a different group of processes for my area of focus, others of these 


sources of process descriptions might well have been invaluable. 


Disassembly and decompilation 


Some processes, however, can be viewed neither through the window of source code 
nor through the window of process descriptions. This is most common when the 
processes are viewed as trade secrets by their authors. In these cases, more active 
approaches to learning process operations may be called for. A common term for 
these approaches is “reverse engineering.” 


Reverse engineering is often used by groups of software or hardware engineers 
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who want to understand the specifics of another group's product. The desire may 
be to understand the product's internal operations in order to see if they infringe 
any patents. The desire may be to understand the file formats or communication 
protocols used by a product, so that a compatible product can be built. The desire 
may be to see if the security systems employed are vulnerable to attack, so that those 
vulnerabilities can be either fixed or exploited. The desire may be to identify the 
source of a known problem with the product so that it can be addressed by someone 
other the original developer (e.g., someone fixing software originally produced by a 
company that no longer exists). There are many other potential motivations for 
reverse engineering, but all are focused on learning how a hidden element of a system 
is structured or operates. Given that understanding hidden systems is also of interest 
to academic researchers and computer hobbyists, these groups have also been active 
in reverse engineering. 

Another way of looking at the reverse engineering of software is as movement 
“up” through the layers of abstraction that define the stored program computer 


% The lowest level in this model 


and traditional software development practices.* 
is compiled, binary code specific to the architecture of a particular computer. The 
highest level is the overall design goals of the software development process. There are 
many levels in between, including the byte code used by virtual machines, high-level 


language source code, detailed listings of system behaviors, and so on. 


Moving between levels requires different types of effort, and a focus on particular 





19T am inspired, here, by Elliot Chikofsky and James Cross, who have proposed a taxonomy of 
reverse engineering and design recovery along these lines (1990). 
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levels may be more or less appropriate depending on one's goals. Specifically, the 
techniques of “disassembly” help one read the low-level assembly code of a piece of 
software when one only has access to the executable form. While often successful, 
this reveals a difficult to interpret mix of operations and data — though one format- 
ted more legibly than the code of Christopher Strachey's above. Á more ambitious 
approach, one currently successful in fewer circumstances, is that of “decompilation.” 
This attempts to reveal source code in a higher-level language from lower-level sources. 
It works better when beginning with sources that are already at a relatively high level 
(such as virtual machine byte code) than it does for machine-specific compiled code. 

Approaches such as these attempt to find a way to look inside a piece of software 
to determine its operations, and in a best-case scenario offer results that still present 
the challenges inherent in reading source code. Given this, they are not appropriate 
for those with only a casual interest in a process. That said, we often have a much 
greater than casual interest in those things we choose to study, and the route from 
disassembly or decomiplation through code analysis to an understanding of processes 
is certainly no steeper than that required to access and attempt to interpret, say, 
ancient Babylonian artifacts. 

On a more general level, it is worth noting that reverse engineering techniques have 
made possible many widely-used technologies. For example, they were employed in 
the creation of the “clean room” BIOS that enabled the development of computers 
compatible with the IBM PC — that is to say, most of the personal computer industry. 
Unfortunately, the space in which one may practice reverse engineering legally is being 


narrowed and distorted by legislation such as the U.S.'s Digital Millennium Copyright 
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Act. While these techniques are the only ways for us to study some processes, the very 
act of study may soon become illegal for processes packaged in certain “protected” 


manners. 


“Black box” analysis 


In contrast to disassembly and decompilation, other approaches to reverse engineer- 
ing achieve their results entirely by actions possible on the outside of the software. 
Because these do not attempt to look inside the software, but only observe external 
behavior, they are often referred to as “black box” analysis techniques. Black box 
techniques are used by software developers, hobbyists, academics, and others. They 
are central, for example, to the ongoing development of Samba, free software that al- 
lows computers running non-Microsoft operating systems to participate in Microsoft 
networks. 

Andrew Tridgell's (2003) outline of the four methods used when writing Samba 


is an enlightening look at the specifics of a particular black box approach: 


The first method is close examination of specifications published by Microsoft. While 


» 


“incomplete and in places rather inaccurate,” studying them reveals important 


basic facts. 


The second method Tridgell refers to as the “French Cafe” technique. A packet 
sniffer observes the transactions between Microsoft servers and clients, and the 
record of these transactions is used to learn the vocabulary employed — just 


as someone who does not speak French might learn words such as “coffee” and 
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“bread” by watching a waiter interact with customers in a French cafe. Once 
the basic structure of some commands is known, one can try other behaviors 


(e.g., accessing a file that doesn't exist) and see the messages that result. 


Method three becomes more systematic. Á program is written that tries all possible 
messages that could be commands (e.g., for 16-bit commands, 64,000 attempts) 
and notes any responses that differ from “not implemented.” For each of these, 
it then tries sending different amounts of blank data along with the command, 
until the message changes. Then different non-blank data of the appropriate size 
is sent until no error results (that is, until the command works). Finally, detailed 


comparisons of server contents reveal what the command actually accomplishes. 


Method four finds the interactions between commands. Large numbers of random 
sequences of commands are performed with Microsoft software and Samba soft- 
ware, looking carefully for differences in resulting behavior. If any are found, 
the minimal sequence that produces the difference is identified and used to guide 


revision of Samba. 


As we can see from the discussion above, in this type of reverse engineering it 
is not important how the software actually operates internally, only that it is pos- 
sible to understand how the software behaves under all relevant circumstances (so 
that a specification can be created for software that will produce exactly the same 
behavior). In a sense, this sort of reverse engineering can only be aimed at making it 
possible to replace or communicate with a piece of software — as with IBM's BIOS 


or Microsoft's networking software — rather than enabling one to contemplate its op- 
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erations. Nevertheless, achieving the level of understanding necessary for such work 
does produce detailed information that is one of the best foundations (in the absence 
of source code or process descriptions) for speculating as to the actual operations of 


the processes in question. 


“Close” interaction 


A different kind of “black box” approach is currently the most common among those 
interpreting the processes of digital works. This approach attempts to view the work's 
processes through close analysis of patterns of behavior that emerge from interactions 
with the work's surface. The potential success of this approach depends greatly on 
the nature of the work being considered. 

In the case of computer games, for example, one of the primary design challenges 
lies in helping the audience understand certain processes — the processes that de- 
termine gameplay. Only once an audience member understands these processes is 
it possible for her to know that her actions can effect these processes in particular 
ways, and then decide to take actions that seem appropriate for achieving a result 
she has chosen. This is the fundamental pleasure of gameplay and other interactive 
experiences that Janet Murray refers to as “agency” (1997) and Doug Church calls 
“intention” (1999). 

Will Wright, designer of games such as SimCity and The Sims, points out that 
players often expect to be able to deduce the processes of a computer game through 


interacting with it, without support from any outside material. He writes: 
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Just watch a kid with a new videogame. The last thing they do is read the 
manual. Instead, they pick up the controller and start mashing buttons 
to see what happens. This isn't a random process; it's the essence of the 
scientific method. Through trial and error, players build a model of the 
underlying game based on empirical evidence collected through play. As 
the players refine this model, they begin to master the game world. It's 
a rapid cycle of hypothesis, experiment, and analysis. (2006, 111) 


Bringing the audience to understand the processes supporting gameplay through 
experimentation was relatively simple in early video games. This is because the 
elements of the system (what 1 will, in a later chapter, develop more fully as the 
“operational logics” of such systems) were quite simple. A game like Pong is founded 
on the elements “collision detection” and “movement” — the minimally-represented 
ball bounces when it collides with things, and the player can control the movement of 
an minimally-represented paddle or racquet. The player can easily grasp that moving 
the paddle into the path of the ball will cause it to bounce back the other direction, 
form the intention for this to happen, and then act on this intention. 

For gameplay supported by more complex processes the design challenge is much 
greater. Wright's games are classic examples of this challenge. SimCity and The Sims 
are based on complex simulation models and, as Wright puts it, “As a player, a lot 
of what you're trying to do is reverse engineer the simulation.... The more accurately 
you can model that simulation in your head, the better your strategies are going to be 
going forward” (Pearce, 2002). But the processes are much too complex for the player 
to deduce immediately through experiment. The designer must carefully introduce 
the player to the different elements of the situation, and help the player build up an 


appropriate mental model. Wright continues: 
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So what we're trying to as designers is build up these mental models in the 
player.... You've got this elaborate system with thousands of variables, 
and you can't just dump it on the user or else they're totally lost. So we 
usually try to think in terms of, what's a simpler metaphor that somebody 
can approach this with? What's the simplest mental model that you can 
walk up to one of these games and start playing it, and at least understand 
the basics? Now it might be the wrong model, but it still has to bootstrap 
into your learning process. 


Wright's genius is in finding ways to teach his audience to understand complex 
processes through interactions that are enjoyable and that incrementally open oppor- 
tunities for meaningful intention relative the larger system. It is no small challenge. 
And this explains, in part, why his games are much less often imitated than those 
supported by processes that are easier to make interactively learnable. 

Given the great effort expended by game designers such as Wright to make their 
processes understandable through interaction, it makes sense to use interaction as a 
route to understanding these processes. In fact, this serves a dual purpose, both giving 
us insight into the work's processes and giving us greater experience of the audience's 
possible engagements. lan Bogost, responding to Sherry Turkle's suggestion that 
the processes of SimCity should be made more transparently visible to the audience, 
points to the strengths of understanding processes through interactions (and the 


pitfalls of depending on source code): 


“Opening the box,” in Turkle's opinion, would allow players to see how 
the simulation runs, providing better ability to critique. 


The problem with this objection is that the player can see how the sim- 
ulation runs: it is, in no trivial way, what it means to play the game. 
Turkle's real beef is not with SimCity, but with the players: they do not 
know how to play the game critically. Understanding the simulation at the 
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level of code does not necessarily solve this problem. Even understanding 
the simulation via some intermediary system poised between the code and 
the existing interface — some have proposed “policy knobs” that could 
alter the simulation rules of a game like SimCity — does not guarantee 
that players will understand the notion of making and interacting with 
arguments as processes rather than words. Rather than addressing this 
problem from the bottom-up through code literacy, we need to address it 
from the top-down through procedural literacy... Part of that process is 
learning to read processes as a critic. This means playing a videogame or 
interacting with another procedural system with an eye toward identifying 
and interpreting the rules that drive that system. Such activity is akin to 
the literary critic interpreting a novel or the film critic reviewing a film — 
demanding access to a computer program's code might be akin to asking 
for direct access to an author or filmmaker's expressive intentions. 


Bogost is, of course, correct that much can be learned about the processes of 
games through interaction, and that the code for games is often unavailable. On 
the other hand, not all game designers share Wright's passion for making processes 
legible to players, and not all designers are as successful as Wright in accomplishing 
the task. Further, the understandings of processes that we achieve through normal 
audience interaction are somewhat intuitive and speculative. Experimentation with 
a rigor that goes beyond normal play will often be required. We can consider such 
rigorous examination through and of gameplay a subset of what Espen Aarseth has 
called “playing analysis,” as opposed to “non-playing analysis” (2003). We might call 
it “close play” — in an analogy with the literary practice of “close reading.” 

Bogost's book provides a number of examples of analysis supported by this more 
rigorous experimental interaction with games. The exhaustive procedures of game 
industry “playtesting” provide another example. Further examples come from the 


work of a variety of scholars and developers. Kurt Squire, for example, brings such 
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an approach to the rarely-studied genre of fighting games. In his essay “Educating the 
Fighter” (2005) Squire discusses the process of defeating the character Hulk Davidson 
in the fighting game Viewtiful Joe. Squire begins by making observations reminiscent 


of Wright's: 


The process of “beating” Hulk Davidson is largely one of learning to 
read what is important in the game space. To do this, the player must 
understand Hulk's moves, understand Viewtiful Joe?*s potential actions, 
how they interact with the problem, and then realign his goals accordingly 
on the fly. Essentially, this is a dual space search problem, similar to 
hypothesis testing in science. 


But Squire does not stop here. Instead, he presents a detailed table of game 
states, events, and consequences possible during the battle with Hulk Davidson. In 
such careful, and presumably exhaustive, approaches we see a method emerging that 
may remind one of the black box analysis techniques described earlier — perhaps 
somewhere on the border of Tridgell's “French Cafe” and “all possible commands.” 
The difference, however, is in the critical goals animating work such as Bogost's and 
Squire's. The goal is not to enumerate the possible states in order to build software 
that will reproduce them, but as the first step in critical analysis of the game in 
question. 

Of course, many digital works are not games, and for this reason we may wish 
to replace the term “close play” with something more general — such as “close in- 
teraction.” But this move points to a limitation of close interaction as a method for 
accessing processes generally. In addition to the fact that only some game processes 


are legible via interaction, there is the fact that many digital works are not interactive 
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at all. Further, as we will see in the example of Tale-Spin later in this study, even 
for many interactive works a comprehensive black box analysis of the sort performed 
by the Samba team would not provide the necessary information for understanding 
the work's processes. 

Given this, certain conclusions present themselves. One is that close interaction is 
an important tool for understanding the vast possibilities for audience experience of 
processes. Another is that, for study of processes themselves, close interaction should 
be (if possible) practiced in concert with other methods of access. 

In practice, as it turns out, these conclusions are generally heeded. While Bogost, 
in Unit Operations (2006), grounds his discussion of SimCity in the results of close 
interaction, he also draws on Wright's descriptions of its processes. Similarly, while 
I have drawn on process descriptions and source code for much of the information 
about processes in this study, interaction has informed my thinking in important 


ways — for those works available in interactive form. 


An Overview of this Study 


This dissertation consists of eight chapters, including this one. This chapter intro- 
duced a focus on the processes of digital cultural products and, especially, digital 
literature. It also presented a wider model of digital media within which this focus 
on processes can be understood — including the elements: author(s), data, process, 
surface, interaction, outside processes and data sources, and audience(s). Next the 


full phrase from computer engineering to which terms such as “digital” and “elec- 
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tronic” refer, in phrases such as “electronic literature,” was unpacked: stored program 
electronic digital computer. After this, it was explained how the computer science 
model of reading code could serve as a complement to, but not a substitute for, the 
practice of reading processes advocated by this study. Finally, the pages preceding 
this one provided an overview of methods for accessing processes. 

Chapter 2 performs this study”s first examination of a particular process, engaging 
the earliest known work of digital literature (and likely the earliest example of digital 
art of any kind): Christopher Strachey's love letter generator for the Manchester 
Mark I. This examination is supported by comparison with other processes, including 
Raymond Queneau's One Hundred Thousand Billion Poems, and a consideration of 
the context surrounding Strachey's work. The discussion is also supported by an 
outline of the wider field of digital literature that has grown up in the more than five 
decades since Strachey?s pioneering work, in the course of which Chris Crawford's 
notion of “process intensity” is discussed in further depth. 

Chapter 3 brings to our discussion three processes with important indeterminate 
aspects: Tristan Tzara's newspaper poem, André Breton and Philippe Soupault's 
automatic writing, and Claude Shannon's n-gram text generation. Through these we 
begin to appreciate the importance of data and context in interpreting processes for 
which indeterminacy is primary. Also, through these and through a consideration 
of Espen Aarseth's influential Cybertext (1997) we outline some points of similarity 
and points of distinction between processes carried out by authors, audiences, and 
automatic computation defined as part of works. 


Chapter 4 briefly outlines the way that Shannon's n-gram text generation came 
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to personal computers and moved into literary use. Through this we begin to under- 
stand the complex composites that make up the authorship of many processes, and 
introduce the concepts and vocabulary of abstract processes, implemented processes, 
and operational logics. The last of these is then clarified through an examination 
of the non-player character behavior in the computer role-playing game Fable. The 
chapter next moves on to a consideration of the work of the Oulipo — a group of 
writers and mathematicians — particularly their transformational procedures. The 
chapter's final concept, which overlaps with the Oulipian concept of “potential liter- 


ature,” 


is then introduced: works of process. 

Chapter 5 is a detailed examination of the most widely-discussed story generation 
system, James Meehan's Tale-Spin, and its accompanying natural language genera- 
tion system, Mumble. Connected with this, the chapter looks at the possible worlds 
theory of fiction (which has been adopted by a number of digital media theorists) 
and current practices in the wider field of Natural Language Generation. As it turns 
out, Tale-Spin's most interesting activities, from a fictional point of view, are not 
represented in its Mumble output. They do, however — even if only visible from an 
“interior” view of Tale-Spin — provide a thought-provoking example of the Artificial 
Intelligence simulation of human behavior. 

Chapter 6 looks at post-Tale-Spin techniques in story generation, natural lan- 
guage generation, and Al more generally. Two story generation systems — Minstrel 
and Universe — were important successors to Tale-Spin that attempted to operate 
at a different level. While Tale-Spin had focused on the plan-directed simulation of 


character behavior, both Minstrel and Universe focused on the goals and plans of 
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authors. The mixed results achieved by these efforts are then contrasted with the 
movement toward statistical AI, including statistical approaches to natural language 
generation, and especially those that build on n-gram models. Finally, dreams of 
machine authorship are considered within the more general Al tendency to anthro- 
pomorphize processes. 

Chapter 7 concludes this study's consideration of the relationship between au- 
thorship and processes. The operations of two further Artificial Intelligence story 
generation systems — Brutus and Terminal Time — are considered. These sys- 
tems are described by their authors using rhetoric that ranges from attempting to 
“challenge” the creativity of human authors to locating authorship (and creativity) 
entirely outside the system. Terminal Times audience interactions inspire reaching 
back to a previous chapter so that the interactions of Tale-Spin can be considered, 
and Michael Mateas's Expressive Al perspective is introduced (with a focus on his 
concepts of “authorial” and “interpretive” affordances). While Mateas's formulations 
are helpful, they move beyond Al toward hypermedia and human-computer interac- 
tion, pointing us toward this study's broader concept of expressive processing. 

Finally, Chapter 8, after a brief overview of this study's conclusions, turns our 
attention to the elements of the model of digital media that this study backgrounded. 
The larger model is revisited from the more developed perspective on processes de- 
veloped over the course of this study. After this, two particular areas are considered 
in more depth: first, interaction and second, surface and audience. Within the dis- 
cussion of interaction a set of distinctions between types of digital media that involve 


different forms and roles of computation is presented. The chapter concludes with a 


few final thoughts about authoring processes. 
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Surface, Data, and Process 


Reading Digital Works 


As discussed in the previous chapter, a defining characteristic of digital work generally 
— and digital art and literature specifically — is that it never consists only of what is 
legible on its surface. Its surface is always, at least in part, produced by computational 
operations. These operations are determined by the processes defined by the work (in 
connection with the computer on which they are carried out) and employ the work's 
data (e.g., text, sounds, images). 

The reason for this is nearly tautological: computers are machines for carrying out 
processes. No literary surface can be produced using a computer without invoking 
processes. The fact of this necessity, however, also points toward an exciting oppot- 
tunity: the processes carried out can go far beyond (in steps, interdependencies, data 
involved, etc.) those possible for any reader to carry out on her own, any author 
to perform by hand, or any collection of analog devices to mechanically accomplish. 
Such processes, especially when specifically designed for the work, can be integral to 


what and how the work means. 
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The major thrust of this study is that to learn to read digital works we must begin 
to interpret not only their surfaces, but also their data and processes. This chapter 
contributes to the overall project with a combination of overview and example. The 
overview sections will look at the three primary groups creating literary work in dig- 
ital media: writers, computer scientists, and game designers. This chapter's example 
is the first known literary experiment with a modern computer: Christopher Stra- 
chey's love letter generator for the Manchester Mark I, completed in 1952. Strachey's 
generator, as l argue we should approach it, requires not only that we read its surface, 
data, and processes — but also that we understand these in the context of his life 
and the environment in which he worked. It requires that we know something of his 


story. 


Christopher Strachey 


People must have wondered if Christopher's father would amount to anything. Born 
to one of England's prominent families, Oliver Strachey was addicted to puzzles, 
expert at chess and bridge, a lover of crosswords, a trained pianist, and apparently 
ill suited to anything that mattered. He was not a good enough pianist to play 
professionally, he took an administrative job with the East India Railway Company 
and hated it, he was unhappily married and divorced. Then, at the outset of World 
War I, he took a post as a cryptographer with the War Office Code and Cypher 
School — and came into his own. His love of puzzles, and skill at them, made him a 


gifted codebreaker. He spent the rest of his career in cryptography, and was honored 
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as a Commander of the British Empire (CBE) in 1943. 

In the mid-1940s there was reason to wonder if Christopher Strachey would ever 
share his father's experience — would ever be able to bring the special shape of his 
mind to bear on a suitable task. He had been a bright child, playing imaginary three 
dimensional tic-tac-toe on his mother's bed in the early morning, and explaining 
mathematical concepts to his nurse at five. His academic accomplishments were not 
a match for his intellect — though they did manage to get him into Cambridge 
University. At university his social and intellectual life continued to receive more 
attention than his academic performance. He graduated without much distinction in 
1939, spent World War II working as a physicist, and 1945 found him leaving that 
work to become a schoolmaster. 

There was nothing to indicate that, seven years later, Christopher Strachey would 
find himself — through happenstance, through interests slightly askew from those 
around him — one of the people to do something “first.” However, though it is not 
yet widely known, in the summer of 1952 he undertook the first experiment in digital 
literature, and perhaps created the first digital art of any kind, when he completed 


his love letter generator for the Manchester Mark I computer. 


Strachey came to this almost out of nowhere. In 1950, just two years earlier, he had 
no official connection to research communities of any sort — not mathematics, not 
engineering, and certainly not computation. He had developed an interest in modern 


computers through scattered articles, books, and radio addresses on the topic. He 
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had never seen one.! And he had not shown a particularly keen interest in creating 
literature or art. But two things in his background can be seen as setting the stage. 

First, there were the circumstances of his upbringing, which may have made it 
more likely that a playful, creative experiment would occur to him as a possible 
use for a computer. Strachey was born in 1916, five years after his father married 
Ray Costelloe — an active suffragist, and a trained mathematician and electrical 
engineer, who came from an American Quaker background. In 1919 the family moved 
to Gordon Square, where Christopher's grandparents also lived. Gordon Square was 
then the center of the Bloomsbury group of artists and intellectuals, and Christopher's 
uncle Giles Lytton Strachey — a prominent member of the group — had just shocked 
the country with the 1918 publication of Eminent Victorians (a skewering of Cardinal 
Manning, Florence Nightingale, Thomas Arnold, and General Gordon). The family's 
neighbors included Virginia and Leonard Woolf, Clive and Vanessa Bell, and John 
Maynard Keynes. 

Second, there was the happenstance of where he went to university. Strachey 
attended King's College, Cambridge, which was then quite small (about 200 under- 
egraduates). While there he met a junior research fellow named Alan Turing, who 
was, at just that time, undertaking perhaps the most fundamental work ever per- 
formed for the discipline of computer science (a discipline still some years from being 
founded) — work treated in some detail in the previous chapter. According to Stra- 


chey biographer Martin Campbell-Kelly (1985)?, it is unlikely that Strachey spoke 





Though he had used a differential analyzer, a kind of analog computing machine, when working 
with differential equations during WWIL. 


2Most of this account of Strachey's life and family is adapted from Campbell-Kelly's “Biograph- 
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with Turing about computing at King's, but he did get to know him. And as the 
Manchester Mark I computer was built, it was Turing who wrote the programming 
manual. Though officially only a teacher at Harrow School, Strachey's personal con- 
nection with Turing was enough to allow him to, in 1951, ask for and receive a copy 
of the manual. And it was this that enabled Strachey's sudden appearance in the 
world of computing. 

Strachey had first seen a modern computer earlier that year. He had been in- 
troduced to Mike Woodger of the National Physical Laboratory (NPL) by a mutual 
friend, and had spent a full January day learning about the Pilot ACE computer then 
under construction at NPL (based on a design of Turing's, see page 27). When he 
returned to school after winter break he began working on a program to make the 
ACE play checkers (or rather, as it is called in England, “draughts”). Then he learned 
of the Mark I that had just been installed at Manchester, which Woodger informed 
him had a significantly larger “store” than the ACE — making it better suited to 
Strachey's programming interests. After receiving a copy of the programming man- 
ual from Turing, Strachey visited for the first time in July, and discussed his ideas 
for a checkers-playing program with Turing. These ideas impressed Turing, and he 
suggested that the problem of making the machine simulate itself using interpretive 


trace routines would also be interesting.* Strachey, taken with this suggestion, went 





ical Note,” while the following material on Strachey, Turing, and the love letter generator also draws 
on Andrew Hodges's biography of Turing, Strachey”s 1954 article in Encounter, Strachey”s papers in 
the Bodleian Library (University of Oxford), and material from the British Broadcasting Corpora- 
tion (BBC) archives. I am indebted to David Durand and Oliver House for archival and interpretive 
work with Strachey?s papers, and for the transcript of Strachey?'s second BBC address 1 am indebted 
to Allan Jones. 


3As David Durand has observed, having a machine simulate itself, as in the problem that Turing 
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away and wrote such a program. As Campbell-Kelly writes (p. 24-25): 


The final trace program was some 1000 instructions long — by far the 
longest program that had yet been written for the machine, although 
Strachey was unaware of this. Some weeks later he visited Manchester 
for a second time to try out the program. He arrived in the evening, 
and after a “typical high-speed high-pitched” introduction from Turing, 
he was left to it. By the morning, the program was mostly working, and 
it finished with a characteristic flourish by playing the national anthem 
on the “hooter.” This was a considerable tour-de-force: an unknown 
amateur, he had got the longest program yet written for the machine 
working in a single session; his reputation was established overnight. 


The attempts to recruit Strachey began immediately, and by November Lord Hals- 
bury of the National Research and Development Corporation (NRDC) had convinced 
him to take a position as a technical officer. Strachey, of course, was still teaching 
at Harrow School — but in 1951 and 1952 he spent long sessions during his school 
breaks at Manchester, working on his checkers program and two assignments already 
given him by the NRDC. He also attended computing colloquia at Cambridge Uni- 
versity, and even gave a two-part BBC radio address about computers that spring. 
In his second BBC talk (Strachey, 1952b) he described the multi-modal interaction 
(image on a cathode ray tube, text on teleprinter) and unusual proto-personality of 


his checkers program?: 


In addition to showing a picture of the board with the men on it on a 





suggested to Strachey for his first Mark I program, is also the basic outline of Turing's demonstration 
of the halting problem (see page 23). 


“More significant than its questionable status as the first computer personality, Strachey's check- 
ers program troubles the claim of A.S. (Sandy) Douglas's OXO to the title of “first graphical com- 
puter game.” Douglas's program, which showed a game of tic-tac-toe on a CRT, was developed in 
1952 for the University of Cambridge EDSAC. 
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cathode ray tube, and to printing out the moves on a teleprinter, the 
machine makes a sort of running commentary on the game. For instance 
it starts by printing “Shall we toss for the first move? Will you spin a 
coin.” It then calls, in a random manner, and asks “Have I won?” There's 
no cheating in this, at any rate as far as the machine is concerned. The 
player has then to feed his moves into the machine according to certain 
rules. If he makes a mistake the machine will point it out and ask him to 
repeat the move. If he makes too many mistakes of this kind, the remarks 
printed by the machine will get increasingly uncomplimentary, and finally 
it will refuse to waste any more time with him. 


By June 1952 Strachey had wound up his responsibilities as a schoolmaster and 
officially began full-time computing work as an employee of the NRDC. That summer 
he developed — probably with some input from others — a Mark I program that 
created combinatory love letters.? 

It is unlikely that Strachey had digital art, of the sort we create today, in mind.* 
For one thing, there would have been little thought of an audience. As with his 


checkers-playing program, the love letter generator could be reported to a wider 





5Campbell-Kelly notes some aesthetic advice from Strachey's sister Barbara while Hodges men- 
tions collaboration with Turing, but neither source confirms the other's account on these points. 
In Strachey's writings he often fails to even credit himself (preferring to say that there is such a 
program, and leaving aside who created it). 


S At the time of Strachey's projects, when the first stored program computers were just coming 
into existence, artistic applications of computers were essentially unheard of. Jasia Reichardt, the 
prominent curator of the 1968 computer art exhibition “Cybernetic Serendipity,” wrote in 1971 that 
computer art's “first tentative steps date back to 1956” (p. 7). The earliest examples cited in current 
surveys of digital art, such as Christiane Paul's Digital Art (2003), are from more than a decade 
after Strachey”s generator. 

It is, of course, quite possible that further research will reveal even earlier digital artworks than 
Strachey's generator. For example, C. T. Funkhouser has written of a 1959 digital poem created 
by Theo Lutz using one of Zuse's electronic digital computers (forthcoming) — which may lead us 
to imagine an earlier work of digital literature/art, using one of Zuse's earlier systems, might be 
uncovered through further research. But, whatever happens, we do know that the field of digital 
literature has more than a half century of history, almost as long as that of the digital computer 
itself and perhaps the longest of any of the digital arts. 
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public, but only experienced directly by a small audience of his fellow computing 
researchers. At the same time, it certainly was not an official assignment from the 
NRDC, but rather, like many creative computing projects, undertaken for enjoyment 
and to see what could be learned. 

Not everyone in Strachey?s small audience enjoyed it equally. Turing biographer 
Andrew Hodges (2000) reports that “Those doing real men's jobs on the computer, 
concerned with optics or aerodynamics, thought this silly, but ... it greatly amused 
Alan and Christopher” (p. 478). Looking at the program's output today, we can 
understand why Turing and Strachey's colleagues thought the project silly. Here is 
an example that Strachey published in a 1954 article in the art journal Encounter 
(immediately following texts by William Faulkner and P. G. Wodehouse): 


Darling Sweetheart 


You are my avid fellow feeling. My afftection curiously clings to your 
passionate wish. My liking yearns for your heart. You are my wistful 
sympathy: my tender liking. 


Yours beautifully 
M-UsCE. 


There could be a variety of reasons why, reading this, we might not share Turing 
and Strachey's great amusement. Perhaps because we are further removed from a 
certain type of purple prose,” or from that early computing culture focused on “real 


men's jobs.” But I think another reason is more likely — that it is not simply the 





"Strachey, in his Encounter article, characterizes the generator's output as giving a “Victo- 
rian” impression (p. 26). But this seems the same view of Victorian culture found in Giles Lytton 
Strachey's Eminent Victorians, which is probably more amusing than accurate. 
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output that amuses, that the resulting letters are not really the interesting part of 
the project. 

When we read an example of output from the love letter generator, we are seeing 
the surface manifestation of two other elements that remain hidden from us: the 
generator's data and processes. It is these two elements that Strachey worked on, 
and any one of the vast number of possible output texts is only an unpredictable 
manifestation of them. It is likely that this unpredictability is part of what amused 
Strachey and Turing, but we will have only a partial understanding of it, or any other 
aspect of the system, if output texts are all we consider. Yet we are unfortunately 
likely to do so. In fact, most reports of the generator (including in the excellent texts 
of Campbell-Kelly and Hodges) provide only sample outputs. The sole exception 
to this I have found is Strachey?s own article in Encounter, which details the entire 
process. 

As the previous chapter began to argue, this is a problem for work in digital 
literature and art generally. We focus on surface output — and as a result our 
understandings are not informed by an appreciation of process and data. These are 
integral parts of computational works, and to fail to consider them means we only 
see digital literature from the audience's quite partial perspective. The fundamental 
fact about digital works is that they operate, that they are in process, and only once 
our interpretations begin to grapple with the specifics of these operations will we be 
practicing a method commensurate with our objects of study. 

That said, before further discussion of the love letter generator, and how a closer 


examination of it can inform our view of the wider fields of digital media and litera- 
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ture, it is necessary to talk a bit about the field which has developed in the fifty years 


since Strachey's first experiment. 


Writers, Computer Scientists, and Game Designers 


The previous chapter established that a computer is a machine for carrying out pro- 
cesses. But there are many potential processes — ones that follow decision flowcharts, 
ones that combine and display many types of media, ones that compile statistics, and 
so on. It is not necessarily obvious which capabilities of the computer are useful from 
a literary, or more generally artistic, perspective. 

It is rare, at this point, to ask directly, “What good is a computer for the arts?” 
Presumably this is because we feel we know the answer. But, in fact, lines of division 
within many creative communities are defined by differing presumed answers to this 
question. 

One can observe this phenomenon at ACM SIGGRAPH, the yearly meeting of 
the Association for Computing Machinery's Special Interest Group on Computer 
Graphics and Interactive Techniques. Every summer tens of thousands of people 
come to SIGGRAPH for a combined industry tradeshow, scientific conference, and 
art gallery. The differing status of images — of the products of computer graphics 
— in different parts of the convention center is striking. 

In the conference portion of SIGGRAPH, dominated by paper presentations from 
computer scientists, images play the role of examples and illustrations. Images are 


there to help explain the real results being reported — which are novel techniques, 
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algorithms, processes. In the art gallery, while there are a few talks, most of the 
presentations are of art works, and most of the works are prints hung on the wall. In 
these prints, the images are not aids to an explanation of results — they, themselves, 
are the results. This can lead to some tension, because the artists know it would 
be impolite to call the images made by the scientists nalve and uninspiring, and the 
scientists know it would be impolite to call the processes used by the artists trivial and 
uninteresting. And such tensions are not unknown in the field of digital literature; 
for example, around the literary readings held at ACM Hypertext conferences. But 
they can also take a somewhat different form. 

In trying to think through differences such as these, one potentially useful piece 
of vocabulary can be found in the writing of Chris Crawford, a noted computer game 
designer and theorist. This is the concept of “process intensity,” introduced in the 


previous chapter, which Crawford describes as follows: 


Process intensity is the degree to which a program emphasizes processes 
instead of data. All programs use a mix of process and data. Process is 
reflected in algorithms, equations, and branches. Data is reflected in data 
tables, images, sounds, and text. A process-intensive program spends a 
lot of time crunching numbers; a data-intensive program spends a lot of 
time moving bytes around. (Crawford, 1987) 


In some ways Crawford's distinction between process-intensive and data-intensive 
approaches maps nicely onto two views of how a computer might be used toward 
literary ends. When a work of digital literature emphasizes the presentation of pre- 
created words, images, and sounds, those are all data. When it emphasizes algorithms 


and calculations, those are processes. 
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On the other hand, as we will see in the examples of this section, it is not quite ac- 
curate to say that authors of data-intensive digital literature are primarily interested 
in data. Rather, it seems better to say that they are primarily interested in what the 
reader sees — they are interested in the surface, the output from the operations of 
process on data. Many writers who have moved into digital literature tend to employ 
data-intensive forms, but writers are primarily attracted to these approaches for the 
results they are able to achieve. Often these are results writers have desired from 


long before the advent of the modern computer. 


From plain text to expressive animation 


Writers over the last century have often wanted to exceed the limitations of black 
text printed on white paper and bound into a traditional codex book. For example, 
twenty years before William Faulkner?s text preceded Strachey?s in Encounter, the 
1934 Publishers? Trade List Annual carried a listing for a book it had also listed in 
1933, but which never appeared: a four-color edition of Faulkner's The Sound and 
the Fury (Meriwether, 1962). Faulkner wanted four-color printing in order to make 
the time shifts in the first section of the book easier for readers to follow. He worried 
that alternation between italic and roman type might not be sufficient, and rejected 
suggestions from the publisher that he felt would harm the flow of the language 
(e.g., space breaks). But such four-color printing would have made the book, even 
after it was a proven critical success, prohibitively expensive for many purchasers — 
especially during the 1930s U.S. economic depression. 


On a modern computer, of course, four color text has practically the same cost as 
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single-color text. For example, the additional cost of transmitting colored text over 
the web (an HTML file made slightly larger by tags indicating sections that should 
be different colors) is negligible, and the ability to display color is already present. 
Combining images with one's text consumes a bit more transmission bandwidth, and 
sounds and moving images a bit more — but for those connecting to the Internet from 
businesses, universities, or high-speed home connections the difference is barely worth 
comment. Finally, there may be some software cost — such as for a program like 
Flash or After Effects, if one wants the text itself to animate — but this is generally 
much less than the cost of the computer, and miniscule compared to large scale color 
printing on paper or film. 

This change in cost structure has opened vast possibilities for writers, especially 
as the networked computer display comes to rival paper as a surface for everyday 
reading and writing. The more we find ourselves reading and writing with a com- 
puters — email, web pages, instant messaging, word processing — the easier it is for 
writers to reach many people with literary works that once could only have reached 
a few, due to the costs of reproduction and distribution. Color printing, moving 
text, and worldwide delivery are now available to writers for essentially no additional 


expenditure.$ 





SOf course, the costs to audience members — rather than authors — are also quite different. If 
an audience member does not own an appropriate computer, there is no way to borrow or purchase 
an individual work as one would an individual book. This tends to limit the audience for digital 
works to those who already have regular access to a computer appropriate for the work (which 
may require a traditional desktop/laptop, a particular game console, a GPS-enabled cell phone, 
or another type of computer). On the other hand, once one has access to an Internet-connected 
computer, at the moment most digital work created by writers is available for free download. Less 
work is available for free download in other areas — such as the work created by game designers. 
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One area in which striking results are beginning to emerge from this is the use 
of animated text. Before the web and Flash became mainstream, most readers were 
rarely, if ever, exposed to interesting animated typography. As John Cayley argues 
in his essay “Bass Resonance,” those exposures that did occur often took place in 
the context of cinema (Cayley, 2005). He writes, “Written words first moved on film. 
Film titling, in particular, is where we must look for a self-conscious and aestheticized 
practice of dynamic typography and, indeed, of dynamic writing. This work predates 
the small body of video-based language work (Kostelanetz), the few but significant 
essays of art language practitioners (Holzer), and also, of course, what writing there 
is that exists in programmable media.” Of course, film title designers, even the 
famous Saul Bass focused on in Cayley's essay, were by definition not working with 
literary texts. While they may have accomplished interesting animations of text, 
those animations had little to do with the meanings of the texts. 

Now, however, the web and Flash — building on momentum established with the 
earlier distribution method of CD-ROM and the earlier preferred software Director — 
have brought writer-produced animated texts to a new level. Rather than the difficult 
to access work of a few scattered practitioners, animated text poetry and fiction are 
now international movements with notable communities in South and North America, 
Asia, and Europe. As John Zuern puts it in his book-length study Articulation 
Animation, “the rapid development and wide dissemination of digital text-processing 
and animation technologies in the last two decades have shifted the deployment of 
text animation into high gear” (2005). The results also, apparently, have wide appeal. 


For example, Dakota, by Korean duo Young-Hae Chang Heavy Industries (2002), 
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has found an audience ranging from visitors to the Whitney Museum to browsers 
of popular online animation forums such as Albino Blacksheep. But Dakota's wide 
appeal is only part of what makes it interesting to us. It also, as Jessica Pressman has 
pointed out (2005), has strong connections with literary modernism — a movement 
in which writers worked toward animated text long before Flash, Bass, or Holzer. 

Dakota presents short segments of capitalized black text, replacing one another 
on a white background, in time to Art Blakey”s jazz drumming. Stark text about 
driving and drinking, guns and gangbangs and Elvis, eventually accelerates into near- 
illegibility. These techniques create an expressive, rhythmic reading experience — 
which might usually be thought of in terms of animation — but Dakota's declared 
affiliations are with modernist poetry. According to its authors, Dakota “is based 
on a close reading of Ezra Pound's Cantos Í and first part of II” (Swiss, 2002). And 
certainly the parallels are there when one begins to look, though Odysseus's jout- 
ney has been replaced by a road trip through the Badlands. The language has also 
changed registers significantly — Pound's opening line “And then went down to the 
ship” becomes “Fucking waltzed out to the car.” But, as Pressman argues, the con- 
nections run beyond theme and language. In fact, the textual animation technique of 
Young-Hae Chang Heavy Industries “branslates a central principle of Pound's poetics, 
“super-position,” which he would later call the “ideogrammic method.” ” 

And while Pound's Cantos did not situate themselves within a new reading tech- 
nology in order to achieve their effect, he did provide a poem for an explicitly tech- 


nological project: Bob Brown's “Readies.”? Like a number of other writers of the 





“The trajectory connecting Pound, Brown, and YHCHI serves as the focus for Pressman's piece. 
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time, Brown believed that writing needed to move as it came into the modern age of 
speed. He wrote, in 1930, “The word “readies' suggests to me a moving type spectacle, 
reading at the speed rate of the day with the aid of a machine, a method of enjoying 
literature as up-to-date as the lively talkies” (1998, 29). But rather than advocate 
writing for the cinema, Brown spent years designing a reading machine suitable for 
bringing moving text into homes and libraries — involving small type on a movable 
tape running beneath a magnifying lens. He also published a book, in anticipation of 
the construction of the machine, titled Readies for Bob Brown's Machine, which in- 
cluded more than 200 pages of contributions from prominent authors such as Pound, 
Gertrude Stein, and William Carlos Williams. In the traditionally-printed versions 
of these texts, the authors chose to score movement with different marks — Stein, for 
example, with equals signs, and Williams with colons. While Brown's machine was 
never constructed, textual animation is now available in many homes and libraries 


on the screens of networked computers. 


From epistolary novel to e-mail in the moment 


Networked computers have also made it possible for authors to undertake other data- 
intensive experiments, including new takes on the epistolary novel. While landmark 
early novels such as Samuel Richardson's Pamela and Clarissa (from the 1740s) 
were told in the form of letters, and while correspondence between characters forms 
a major component of later novels such as Bram Stoker's Dracula (1897), there is 
something strange about the presentation of these stories. On some level they give 


the reader the sense of listening in on correspondence, of having access to a series 
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of documents produced through time — but they actually arrive in a very different 
form. Of course, it would be impossible to hand-produce a copy of each letter for each 
individual reader, and have each letter then individually delivered to each reader's 
address — the costs of duplication and delivery would be enormous — but it would 
be much closer to the epistolary spirit of the works than printing large collections of 
the correspondence in books or magazines. Some readers have attempted to address 
this by, for example, reading Clarissa according to the calendar: on each day only 
reading the letters of Richardson's novel that bear the same date. While this might 
offer a pacing more like listening in on correspondence, the disjunction between the 
book's form and that of 18th century letters remains strong. 

Of course, now, many of us write more email than paper letters. And from this has 
arisen an opportunity for writers: to allow us to listen in on fictional correspondence 
that arrives in the proper form (electronically) and at the proper pace (as though 
we are carbon-copied at the moment the messages are sent by the characters). Some 
writers have chosen to take up this opportunity in just this manner, and others have 
chosen to translate epistolary fiction into email by different means. 

Rob Wittig's Blue Company is an example of the first tendency. It provides 
readers with the messages sent by a technology worker (Bert) to the woman with 
which he is enamored — after his company transfers him not only to a distant location 
(Italy), but also to a distant time (1368) (Wittig, 2001, 2002). As the messages arrive, 
readers learn the story of Bert's corporate team, which locals call the Blue Company, 
and hear Bert's side of his attempted long-distance relationship. The story has been 


“performed” twice, in 2001 and 2002, as a series of messages sent out to an audience. 
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At each telling the messages included references to that year's current events, and 
the 2002 performance involved design collaboration with Rick Valicenti. 

A rather different narrative experience awaits readers of Michael Betcherman and 
David Diamond's The Daughters of Freya (2004-05). While the texts of both it and 
Blue Company are chatty and email-like, Betcherman and Diamond's piece largely 
follows mystery novel conventions in relating the story of a reporter investigating 
a California sex cult and, eventually, a murder. The forms of the stories are also 
rather different. The Daughters of Freya is not performed at particular times, but 
always available to new readers, who receive their first message a few hours after 
signing up at the project's website. Also, while Blue Company maintains a close 
correspondence between the messages sent by the characters and those received by 
readers, The Daughters of Freya often includes messages from multiple characters in 
a single email received by readers, with datestamps driven by the story?s timeline 
rather than the time of reading. Here email is used to provide context and produce 
suspense (as readers accustomed to revealing the mystery at their own pace must wait 
for messages to arrive, including some significant spacing out of the story's climax) 
but not to mimic the experience of being cc'ed on correspondence. Finally, while 
Betcherman and Diamond do not “perform” the work anew at intervals, responding to 
current events, they have taken advantage of their publishing platform's potential for 
revision. In email correspondence, Betcherman has outlined for me three significant 
revisions in the story's ending carried out during the first half of 2005 — all of which 
were suggested by responses to a survey sent out to each reader at the conclusion of 


the story. The ability to tinker with a story even after publication, of course, has also 
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long been a dream of writers. 

But whatever the literary interest of this work, and however much it fulfills long- 
standing ambitions of writers, from a computer science standpoint it is utterly trivial. 
While writers and literary critics may see important differences between Blue Com- 
pany and The Daughters of Freya, on an algorithmic level they are essentially the 
same — human-written chunks of text sent to readers via email — and totally un- 
interesting. Similarly, the vector animation techniques on display in Dakota are so 
well-understood that they are packaged into a mass-market tool like Flash. The in- 
novation is all in the text and its combination with sound and movement. There is 


no algorithmic innovation to spark interest from a process-oriented perspective. 


Interactive characters and generated stories 


For their part, computer scientists can claim some literary successes that stem from 
process-intensive approaches. The interactive character Eliza/Doctor, for example, 
was created by Joseph Weizenbaum at MIT in the mid-1960s — and has been con- 
tinually read and ported to new computing platforms for four decades (Weizenbaum, 
1976). This textual character is a parody of a Rogerian therapist, and involves an au- 
dience member in the experience of a simulated initial interview. The results can be 
surprisingly engaging, as Eliza/Doctor uses simple rules to turn the audience mem- 
ber's typewritten statements back in the form of open-ended questions and prompts 
to talk further. 

While not all writers would be prepared to recognize Eliza/Doctor as literature, 


most can accept the idea that presenting a character through conversation with the 
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audience guided by previously-authored texts and rules — rather than through recita- 
tion of unvarying text — has the potential to be literary. Yet “believable characters,” 
the research area that Eliza/Doctor helped launch, contains work that most writ- 
ers would find without relation to their own. The oddness of this circumstance is 
compounded by the fact that the gap between the believable character work that 
many writers might recognize and that which most would not is nearly invisible 
when viewed through certain computer science lenses. In fact, simultaneous and 
closely-related projects by the same research group could occupy the gap's two sides. 

For example, let us take the Oz Project at Carnegie Mellon University. This 
project, led by Joe Bates, sought to create believable characters that could play roles 
in interactive dramas. The Oz Project's early 1990s Lyotard was a textual piece, 
presenting a simulated house cat with an intriguing personality living in a simulated 
apartment. To experience the piece an audience member would read descriptions 
(to understand the world and the cat) and write them (in order to take actions and 
perhaps befriend the cat). While not a traditional form of writing, the output of the 
system is certainly recognizable to writers. But the Oz group simultaneously pursued 
another work, The Edge of Intention, which featured no text at all. Though, from 
a computer science perspective, both Lyotard and The Edge of Intention were part 
of the same research project, the latter was presented entirely as real-time animation 
(Bates, 1994). 

From the average writer's perspective these projects are utterly different. One is a 
kind of interactive writing, and the other is a kind of interactive animation. But from 


the point of view of the Oz Project, they are fundamentally the same. The real work 
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of the Oz Project was in developing computer processes that would allow for rich 
interactions with author-defined personalities over time. In Lyotard the personality- 
defining processes were hooked up with processes for describing the character and 
world through text, as well as for accepting audience interaction through text. On 
the other hand, in The Edge of Intention the personality-defining processes were 
hooked up with processes for displaying the characters and world through animation, 
as well as for accepting audience interaction via a mouse. But their primary work, 
from the point of view of the project, remained the same: portraying interesting 
fictional personalities, a fundamentally literary goal. 

A similar gap exists in the area of story generation. Story generation systems, from 
a computer science perspective, are primarily methods for generating story structures. 
Beyond that, some systems, such as Selmer Bringsjord and David Ferrucci's Brutus, 
are constructed with generation of literary text for these structures as an important 
part of their operations (Bringsjord and Ferrucci, 2000). But others focus entirely 
on generating story structures, with text output nothing more than a report on the 


structure, as is apparent in this simple example from Raymond Lang's Joseph (1999): 
once upon a time there lived a dog. one day it happened that farmer 


evicted cat. when this happened, dog felt pity for the cat. in response, 
dog sneaked food to the cat. farmer punished dog. 


Writers versus computer scientists? 


In later chapters we will look in depth at story generation. For now, let us consider the 


differences between the above approaches to surface output and underlying processes. 
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The differences are so great that — within the field of digital literature, and also in 
the digital arts more generally — it has often been difficult for those within the arts 
and sciences to appreciate the value of the others? contributions, though profoundly 
creative and interesting work has emerged from both directions. 

In part this may be due to the fact that the differences can be read as a stereotype: 
writers innovate on the surface level, on the “reading words” level — while computer 
scientists innovate at the process level, the algorithm level, perhaps without words at 
all. But as soon as this stereotype is expressed directly it also becomes apparent that 
it must be taken apart. And we have much more to point toward, for this purpose, 
than the work of those few writers who are also computer scientists. 

In the 20th century alone there is a large body of work by writers, many of whom 
never used computers, that is explicitly process oriented. This includes work associ- 
ated with groups such as the Oulipo, Dada, and Fluxus, as well as work associated 
with individuals such as William S. Burroughs, Jackson Mac Low, and John Cage. 
Some of this work will be discussed in more detail over the next few chapters. But 
simply acknowledging its existence allows us to see that writers — not just computer 
scientists — have viewed innovation at the process level as a site for literary creativ- 
ity. Adopting a vocabulary of “writers versus computer scientists” will not help us 
as we seek to move forward. 

Another way of putting this is that the foregoing section's stereotypes are only 
tenable if we limit our view to the most obvious examples from the field of digital 
literature. Exposed to the broader field of literary work they are quickly revealed as 


inappropriate. 
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Process intensity and computer games 


In addition to writers and computer scientists, there is a third group, overlapping 
and yet in some ways distinct, investigating digital literature. Chris Crawford, whose 
notion of “process intensity” was noted earlier, emerges from this third group: game 
designers. 

Let us return to Crawford 's notion. He seeks not only to provide clarifying vocab- 
ulary when discussing “process intensity,” but also to provide guidance for software 


developers, especially game designers. He writes (Crawford, 1987): 


Processing data is the very essence of what a computer does. There are 
many technologies that can store data: magnetic tape, punched cards, 
punched tape, paper and ink, microfilm, microfiche, and optical disk, 
to name just a few. But there is only one technology that can process 
data: the computer. This is its single source of superiority over the other 
technologies. Using the computer in a data-intensive mode wastes its 
greatest strength. 


Because process intensity is so close to the essence of “computeriness”, it 
provides us with a useful criterion for evaluating the value of any piece of 
software. 


Crawford's argument is, on some level, quite compelling. Certainly it would be 
hard to argue against the notion that the computer is a technology for carrying out 
processes — as demonstrated by the short history of stored program electronic digital 
computers in the previous chapter. It would also be difficult to argue that, in digital 
art and media, our great challenge is anything but grappling with what computational 
processes make possible. 

At the same time, however, it would be irrational to argue that we are at anything 


other than the “grappling” stage. We're only beginning to learn to author artistic 
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processes. 

Given this, the most effective surface results — and therefore many of the most 
effective audience experiences — are often, at this time, achieved by data-intensive 
approaches. This is true not only in the field of digital literature, but also in the 
overlapping field to which Crawford's essay is addressed: computer game design. 
While early computer games were often primarily composed of process definitions, 
with relatively simple image and sound data, today's computer games have become 
focused on more and more refined surface representations. As a result, many major, 
“AAA” game releases are dominated by data, often containing as many image and 
sound “asset files” as lines of software code.* 

Certainly not all computer games are literary, but some of the best-known works 
of digital literature have emerged from the field of computer games. And, in fact, 
computer game authors are a population within the field of digital literature that has 
shared members with both the computer science and writer populations. Take, for 
example, the field of “interactive fiction” or “text adventures,” often associated with 
Zork and its publisher, Infocom. In these games, which first emerged in the 1970s, 
a world (containing objects and often characters) is described textually. As with 


Eliza/Doctor (or the Oz Project's Lyotard) an audience member can “write back” 





10 This observation was made by Scott Cronce, of major game publisher Electronic Arts (EA), 
during an “academic summit” sponsored by the company for the 2005 Game Developer's Conference. 
An important exception to this is provided by the games designed by Will Wright, such as Sim City 
and The Sims. These buck the industry trend toward massive numbers of data files, and instead 
define compelling processes for audiences to interact with. Wright's games are also nonviolent and 
among the best selling games of all time. Wright spoke at the same gathering as Cronce, with both 
arguing that the current, data-intensive model of game development is not a viable direction for 
future growth. (Mateas, 2005) 
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to the system, usually to control the actions of a particular “player” character in the 
world. 

The basic processes underlying these games were developed in a computer science 
research context before they became the basis of some of the best selling software 
of the 1980s. And they are certainly more process intensive than the branching- 
story “Choose Your Own Adventure” books with which they are sometimes (un- 
fairly) compared. However, while Infocom employed a number of accomplished 
writer/programmers, they also worked directly with more traditional writers such 
as Douglas Adams, and similar processes to those used in the Infocom products now 
form the basis for the work of a world-wide community of non-commercial interactive 
fiction authors (only a small number of whom have computer science or program- 
ming training). This is possible because, by comparison with approaches such as 
structure-oriented story generation, interactive fiction is actually a rather data in- 
tensive undertaking. Much of the work of making a successful interactive fiction, 
especially now that the basic processes have become formalized, is simply the work 
of writing and structuring the fictional world. Creating innovative new processes is 
not required. 

Given its focus on interaction, interactive fiction will not be a major topic of this 
study, nor will the game industry's even more asset-heavy current versions. However, 
what I hope this short gloss makes clear is that the computer game community is 
another participant in the field of digital literature — and one that, like most writers, 
is primarily concerned with near-term audience experience and has therefore largely 


chosen data-intensive approaches. 
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The limits of process intensity 


Crawford's notion of process intensity is quite valuable, and it will be employed in 
later chapters. But in this study it is seen as more useful to us as a descriptor, rather 
than as a “criterion for evaluating the value of any piece of software.” Further, as we 
employ Crawford's concept in the field of digital literature, there are three important 
caveats to bear in mind. 

First, we should acknowledge that data-intensive approaches are being used not 
because people don't understand that the computer is a machine for carrying out 
processes, but because in many cases these techniques currently produce better re- 
sults. We are only beginning to develop processes for artist purposes — and it is 
not easy. While it is possible, for example, to define processes that will generate 
images, there are many unsolved research questions in this area, especially if it needs 
to be done quickly. This is why, for now, game companies primarily take the eas- 
ier and more reliable route of employing large numbers of pre-created image files 
in combination with less ambitious processes. Generating literary elements is even 
more difficult. Audiences can be unforgiving of computer-driven characters that lose 
conversational threads, computer-generated sentences that follow rules toward inco- 
herent or bland results, and computer-generated stories that are less interesting than 
mediocre human-authored ones. This is why interactive fiction games, animated web 
poems, and email narratives have all focused on data-intensive solutions. 

Second, just as there is nothing wrong with data-intensive approaches to paper- 
based literature, there is nothing wrong with such approaches in a computer con- 


text. As we increasingly read and write on the computer, writers will put interesting 
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text into this context — as email messages, calendar entries, blogs, cell phone text 
messages, and so on. There will also likely be more procedural versions of these 
(email novels to which we can write back, word processing files equipped with artistic 
macros) but the more process-intensive ones will not be inherently superior to the 
more data-intensive examples. 

Third, and perhaps most importantly, the concept of “process intensity” alone 
does not focus our attention on what is interesting from a literary perspective. For 
example, we could imagine three versions of the Oz Project's Lyotard — presented 
through text, asset-file graphics, and procedurally-generated graphics. Each would 
have a different level of Crawford's process intensity. But the literary processes, which 
determine the behavior of the interactive believable character and simulated world, 
would be exactly the same. Our interest lies not in process intensity as such, but in 
the intensity of literary, expressive processes. 

Recall the discussion of adjustable processes in computer games from the previous 
chapter (beginning on page 14). When games allow players to lower the level of 
graphical detail — lowering the graphical process intensity — the experience of the 
game, the gameplay, is presented as the same. On the other hand, when players are 
allowed to lower the level of performance of the game's AI — lowering the Al process 
intensity — this is presented as changing the gameplay (often as lowering the game's 
difficulty level). This demonstrates that not all processes contribute equally to the 
experience and meaning of digital works. 

In the case of the Oz project, the expressive processes are those that determine 


character behavior. If those were altered, and made less intense, this would change 
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the intensity of the expressive processes. On the other hand, if the level of graphical 
process intensity changes this has little impact on the main expressive processes at 
work. Similarly, a version of the Oz system could be built that, through certain 
interactions, could trigger massive database manipulations. The process intensity 
would go through the roof, but the intensity of the project's expressive processes 
would remain the same. 

The remainder of this study is focused on works characterized by the intensity of 


their expressive processes. 


Understanding the Love Letter Generator 


Now, with some sense of the form of the field of digital literature, it is possible for 
us to return to Strachey's love letter generator. We can now attempt to read the 
first work of digital literature in a manner informed by the concerns that have been 
clarified over the more than five decades since its creation. 

First, let's remind ourselves of what the generator's output looked like. Here is 
another example from Strachey's Encounter article: 


Honey Dear 


My sympathetic afftection beautifully attracts your aftectionate enthusi- 
asm. You are my loving adoration: my breathless adoration. My fellow 
feeling breathlessly hopes for your dear eagerness. My lovesick adoration 
cherishes your avid ardour. 


Yours wistfully 
VL:U:-G: 
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The hopeless romantic of these letters — “M. U. C.” — is the Manchester Uni- 
versity Computer, or Mark I. M. U. C. played the part of a love letter author by 


carrying out the following process, as outlined in the same article: 


Apart from the beginning and the ending of the letters, there are only 
two basic types of sentence. The first is “My — (adj.) — (noun) — 
(adv.) — (verb) your — (adj.) — (noun).” There are lists of appropriate 
adjectives, nouns, adverbs, and verbs from which the blanks are filled in 
at random. There is also a further random choice as to whether or not 
the adjectives and adverb are included at all. The second type is simply 
“You are my — (adj.) — (noun),” and in this case the adjective is always 
present. There is a random choice of which type of sentence is to be used, 
but if there are two consecutive sentences of the second type, the first 
ends with a colon (unfortunately the teleprinter of the computer had no 
comma) and the initial “You are” of the second is omitted. The letter 
starts with two words chosen from the special lists; there are then five 
sentences of one of the two basic types, and the letter ends “Yours — 
(adv.) M. U. C.” 


The generator's surface 


As Jeremy Douglass notes (2000), the love letter generator has often been used as an 
example in discussions of queer identity, rather than considered carefully as a literary 
project. Certainly there are reasons for this — Turing and Strachey were both gay, 
and at least Turing openly so, at a time when homosexuality was illegal in England. 
It might also seem from widely-reproduced outputs of the generator (e.g., that found 
in Hodges) that it was a love letter generator that “could not speak its name” — the 
word “love” being conspicuously absent. 

Equally revealing, however, is the almost complete lack (with the exception of 


Douglass's contribution) of attempts to read the generator's output in literary terms 
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— to give it close consideration. Certainly our existing tools for literary work are 
sufficient to perform a reading of surface text, of output, even if this study has cast 
come doubt on their sufficiency for considering data and process? No, a lack of 
means for approaching the generator's output does not seem the likely cause of this 
silence. Rather, to put it baldly, my guess is that scholars have not approached the 
generator's output from a literary perspective because it simply does not feel human. 
The letters preserved by Strachey are not texts that anyone would write — and yet, 
unlike certain modernist and postmodern texts, they do not achieve a paradoxical 
interest value based on the knowledge that they were written by a person. These 
inhuman texts were, in fact, produced by a machinic process. 

In part the inhuman feeling comes from the mismatched awkwardness of declara- 
tions such as “You are my avid fellow feeling” (in the letter reproduced at the start 
of this chapter). But it may be even more dramatic in the patterns of permutational 
word repetition in the example above. In the first sentence we have “sympathetic 
aftection” followed by “affectionate enthusiasm.” In the next sentence, “loving adora- 
tion” followed by “breathless adoration.” The following two sentences echo previous 
phrasing with “breathlessly hopes” and “lovesick adoration” — after which the “let- 
ter” is abruptly over. 

The generator's texts do not seem like a fumbling attempt to express love, but like 
something other than expression, something that is not working toward the pleasures 
traditionally found in reading. How, then, can we read the love letter generator? As 
was suggested earlier, I believe that, if we're going to find something that interests 


us, in this project that “greatly amused” Strachey and Turing, we're going to have 
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to look beneath the surface. 


The generator's data 


How can we read the generator's data — its sentence templates and, especially, the 
“lists of appropriate adjectives, nouns, adverbs, and verbs” available to the sentence- 
assembling process?!! These are not a traditional text, but rather represent the 
spectrum of possibilities for each open slot in the generator's structure, for each run 
of the program. 

One thing we can do with the data is compare it against the ideas we have de- 
veloped while reading examples of surface text. By looking at the complete list of 
available words (table 2.1) we can see, for example, that the absence of the word 
“love” from printed examples of the generator's output was simply an accident of 
randomness, rather than a deliberate, telling lack built into the system. The gener- 
ator's complete vocabulary contains “love,” “loves,” “loving,” “lovingly,” “lovesick,” 
and “lovable.” But, given the nature of randomness, we might not have realized this 
— even after reading many examples of the generator's surface output — without 
examining the system's data. 


Another thing we can do with data is look for patterns in it. Data may be 





1 Of course, for many works of digital literature it is a challenge to get access to the data. While 
Strachey, like most computer scientists, published an account of his project's processes, it is rare 
to publish a projects complete data. In this case, the relevant papers of Strachey's at the Oxford 
Bodleian Library were consulted. These contain a complete program listing for the generator, 
from which its data was extracted. Unfortunately, most early work in the digital arts was not 
so scrupulously preserved. The importance of preservation issues for digital literature, combined 
with some practical suggestions for current authors, is the subject of Acid-Free Bits (Montfort and 
Wardrip-Fruin, 2004). 
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Table 2.1: Love Letter Generators Data 


Category 


Words 





Adjectives 


Nouns 


Adverbs 


Verbs 


Letter Start 


anxious, wistful, curious, craving, covetous, 
avid, unsatisfied, eager, keen, burning, 
fervent, ardent, breathless, impatient, loving, 
lovesick, affectionate, tender, sweet, 
sympathetic, fond, amorous, erotic, 
passionate, devoted, dear, precious, darling, 
little, lovable, adorable 

desire, wish, fancy, liking, love, fondness, 
longing, yearning, ambition, eagerness, 
ardour, appetite, hunger, thirst, lust, passion, 
aftection, sympathy, fellow feeling, 
tenderness, heart, devotion, fervour, 
enthusiasm, rapture, enchantment, 
infatuation, adoration, charm 

anxiously, wistfully, curiously, covetously, 
eagerly, avidly, keenly, burningly, fervently, 
ardently, breathlessly, impatiently, lovingly, 
aftectionately, tenderly, fondly, passionately, 
devotedly, seductively, winningly, beautifully 
desires, wishes, longs for, hopes for, likes, 
clings to, wants, hungers for, thirsts for, 
yearns for, lusts after, sighs for, pines for, 
pants for, woos, attracts, tempts, loves, cares 
for, is wedded to, holds dear, prizes, 
treasures, cherishes, adores 

dear, darling, honey, jewel, love, duck, 
moppet, sweetheart 
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carefully authored or selected in order work with processes in particular ways. It 
may have tell-tale absences or conspicuous repetitions. In this case, however, what 
seems apparent is a lack of careful shaping. Strachey wrote, in his Encounter article, 
that “the vocabulary is largely based on Roget's Thesaurus.” Here we can see that 
the data looks like a verbatim transcription from that source. Here we can begin 
to ask ourselves questions, but only preliminary ones. For what sort of processes 
would one choose to copy the data from a thesaurus, rather than carefully select each 
element? Is this data a determining factor for the work? What would happen if it were 
substituted for thesaurus entries associated with different interpersonal relationships, 
or with an entirely different topic? 

As these preliminary questions reveal, most of what it might be interesting to 
interpret about this data can only be considered in the context of the generator's 
processes. In general, in process-intensive work, data is interesting primarily when 
considered for how it will be employed in processes. And so it is to this challenge, of 


interpreting the generator's processes, that we must turn. 


The generator's processes 


How can we begin to read processes? That is to say, how can we begin to interpret 
what a work does, what it can do, instead of only what it says? 

First we would need to identify some features of the work's processes from which 
to begin our interpretation. One approach to this is comparison — considering two 
or more processes together, and seeing which shared and differing features emerge. 


This is the approach taken here. We will begin by comparing Strachey's love letter 
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generator with two other works in which processes play a significant role: one is an 
influential literary work, and the other is the contemporaneously-developed version 
of Strachey?s checkers playing program. The features that emerge through this com- 


parison, when considered in context, will form the starting point for interpretation. 


One Hundred Thousand Billion Poems 


The Oulipo (Ouvroir de Littérature Potentielle, or Workshop for Potential Litera- 
ture) was founded in 1960 by Raymond Queneau and Francois Le Lionnais. It was 
founded after Queneau ran into Le Lionnais, a friend of his, while at work on a difficult 
and unusual project he did not feel he had the strength to continue (Lescure, 1986). 
Queneau reports, “he suggested that we start a sort of research group in experimen- 
tal literature. That encouraged me to continue working” (p. 32). The project was 
Queneau's Cent mille milliards de poems, or One Hundred Thousand Billion Poems 
(1961). 

This work consists of 10 sonnets, each of 14 lines. While one might expect, then, 
that this work would be more suitably titled Ten Poems, there is something in the 
construction of each poem that causes the number of potential poems to be much 
larger than ten. To wit: a reader can construct alternate poems by reading the first 
line of any of the original sonnets, followed by the second line of any sonnet, followed 
by the third line of any sonnet — and find that the whole work is artfully constructed 
so that any reading of this sort produces a sonnet that functions syntactically, met- 
rically, and in its rhyme scheme. This is made easier by the way the poem is printed, 


with each poem on a page cut into fourteen strips that can be turned individually. 
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Each line of poetry rests on an individual strip of paper, so that new poems can be 
composed by turning strips to reveal lines originally used for one sonnet or another. 

This process, carried out by the reader, creates a dizzying number of possibilities. 
When one chooses which of the first lines to read, there are ten possibilities. Next, 
having read one of the ten first lines, one can choose any of the ten second lines 
— meaning that there are one hundred (10 x 10) possibilities for reading the first 
two lines. After reading a second line, one can choose any of the ten third lines 
— meaning that there are a thousand (100 x 10) possibilities for reading the first 
three lines, and so on. This type of work is called “combinatorial literature” and 
Oulipo member Harry Mathews, while incorporating a quotation from earlier writing 


by fellow Oulipian Claude Berge, writes of combinatorics that: 


[its object is] the domain of configurations, a configuration being the pre- 
set arrangement of a finite number of objects, whether it concerns “finite 
geometries, the placement of packages of various sizes in a drawer of lim- 
ited space, or the order of predetermined words or sentences.” 


Arrangement, placement, order: because these are the materials of Oulip- 
ian combinatorial research, what generally results can be called rearrange- 
ment, replacement, reordering, subsumed by the generic term permuta- 
tion. (Mathews and Brotchie, 1998, 129) 


While combinatorial literature is concerned with the arrangement of fixed ele- 
ments, it is important to note that not all the elements have to be employed in each 
result — not all the packages have to fit in the drawer. Certainly a major feature of 
Queneau's One Hundred Thousand Billion Poems is that only 14 of its 140 lines are 
used in the production of any of its potential sonnets. And from this we can see that 


Strachey's love letter generator is a work of combinatorial literature — one of those 
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that preceded the first work of the Oulipo, a historical circumstance the Oulipo came 
to call “anticipatory plagiary.” 

What can we say about the love letter generator's processes, in comparison with 
those of One Hundred Thousand Billion Poems? To begin, we can observe that its 
processes are random, and carried out by a computer, whereas Queneau's Poems are 
always the product of reader selection. The generator's processes are also quite a bit 
more combinatorial than those of the Poems. The generator carries out a combinatory 
process in the selection of nearly every word when creating sentences that follow the 
pattern of “My — (adj.) — (noun) — (adv.) — (verb) your — (adj.) — (noun).” In 
each of these word selections, the number of potential choices is also not small. For 
example, there are 31 possible adjectives that could occupy the open space after the 
sentence's initial “My” and 29 possible nouns for the slot following that, creating 899 
possibilities for just the first three words of each sentence of this form (424,305,525 for 
a complete sentence).!'? One Hundred Thousand Billion Poems, on the other hand, is 
only combinatory on a line-by-line basis, and there are only ten options for each line. 

But at least as important as the degree of combinatorial operation in these works 
is the nature of what is being permuted — the data that is being arranged by these 
processes. In Queneau's piece, the chunks of data are quite large: full lines of the 


sonnet. Because the chunks of data are large it is possible, though it requires a high- 





12Given that each love letter contains five sentences, each of one of the two types, one can 
add together the number of possibilities for each of the two sentence types and then take the re- 
sulting number to the fifth power in order to determine the number of possibilities for the main 
body of the letter (leaving aside the letter's opening and closing words). I calculate this to be 
753,018,753,081,800,000,000,000,000,000,000,000,000,000 — a number much greater than one hun- 
dred thousand billion. 
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wire act of writing, for Queneau to enforce structure through the artful construction 
of each line. One Hundred Thousand Billion Poems not only maintains scansion 
and rhyme in all its permutations (which only requires constructing 10 sonnets with 
identical schemes in these dimensions) but also syntactic sense, which requires artful 
parallel constructions across the sonnets. Further, the different original sonnets have 
different topics, but evocative, potentially-related imagery which enriches the possible 
reader experiences. As Stephen Ramsay characterizes reading One Hundred Thousand 


Billion Poems: 


Though one might create a poem using random selections, there is nothing 
inherently aleatory about the process.... Rather, one consciously and 
deliberately looks for interesting combinations of lines and poetic effects. 
In building my sonnet, I found myself unable to resist the urge to make the 
wild horses of the Elgin marbles seize “the thumb- and finger-prints of Al 
Capone.” ... One has the clear sense of having discovered something with 
these combinations — of having liberated certain energies in the work — 
while at the same time having beat Queneau at his own game. (Ramsay, 
2003, 54) 

Once again, we see the potential power of data-intensive approaches. In contrast, 
the love letter generator achieves greater combinatorial possibility by working with 
smaller units of data. It carries out more operations on smaller units — it is process 
intensive. And because the data units are small (individual words), and because the 
selection included in the work is not carefully shaped in any obvious way (e.g., only 
rhyming nouns), the love letter generator does not seem to achieve shape through 
data the way that One Hundred Thousand Billion Poems does. 


It might be possible for the generator to, instead, achieve shape through process. 


For example, the processes could be elaborated to avoid particularly poor results 
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(e.g., excessive repetition) or to enforce particularly pleasing patterns of some sort 
(e.g., linked imagery between sentences, alliteration, or even rhyme). Though some 
of these might require slightly more elaborated data, this does not seem the most 
important facet of the fact that more complex processes were not used to give more 
structure to the generator's output texts, to make them better love letters. So let 
us remember this fact, and return to it after comparing the generator with another 


example process. 


Strachey*s checkers-playing program 


Strachey completed the first version of his checkers-playing program for the Mark 
I before he began work on the love letter generator. His original design for the 
program focused on an approach from game theory that is now relatively well known 
as a “game tree search” or “minimax” algorithm. Strachey describes it as follows in 


his Encounter article: 


In the scheme actually used the machine “looks ahead” for a few moves 
on each side. That is to say that it selects one of its own possible moves, 
discovers all the legal replies for its opponent, and tries them out one by 
one. For each combination of its own move and its opponent's reply, it 
then finds all its own possible second moves and so on. As there are, on an 
average, about ten legal moves at each stage of the game, the number of 
moves it has to consider gets very large indeed if it is to look ahead more 
than a very few steps. After a certain number of these stages (the number 
being determined by the time taken) the machine evaluates the resulting 
position using a very simple scheme. It notes the value of this position 
and ultimately chooses the move which leads to the best possible position, 
always assuming that its opponent makes the best of the possible moves 
open to him. (p. 27) 
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With this much description we can begin to draw some basic distinctions between 
the love letter generator and the checkers-playing program. Some distinctions, such 
as that one set of processes selects words while the other selects game moves, are so 
straightforward that most of them will be passed over. More important is a com- 
parison, for example, of how these selections are made. Let's start at the beginning. 
When the checkers-playing program begins the process of selecting a move, it starts 
by looking at the current state of the board, and then projects forward. This means 
that the program must constantly keep track of what is often called “state informa- 
tion” — it must “maintain state” over the course of the game — in order to know 
where to begin. And in selecting each move it projects forward many possible states, 
with the choice based on the best possible outcome of a series of moves. 

By contrast, what kind of state does the love letter generator maintain? It must 
know what stage in the generation process is taking place — beginning, ending, or 
main body. It must also know when two sentences of the form “You are my — 
(adj.) — (noun)” appear consecutively in the main body, so that it can follow the 
rule requiring that “the first ends with a colon ... and the initial “You are” of the 
second is omitted.” But the determination of which sentence types will be used is 
random, and so is the selection of the word that will fill each open slot for an adjective, 
noun, adverb, or verb. None of what has already been decided plays into the current 
decision, and certainly no forward projection of future possibilities is carried out. 

Why is this? Certainly it is not because the computer was incapable of it, or 
Strachey was incapable of it — the checkers-playing program, after all, was written 


before the love letter generator. In part it may have been that the mathematical 
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operations of playing a zero sum game were more amenable to an approach that 
made complicated decisions based on state information. But more important than 
speculation, for our purposes, is the simple fact that a state-free design was chosen 
for the generator?s processes. 

Before discussing this issue further, however, let's look at another facet of the 
checkers-playing program. While the game tree search algorithm it used was not 
unknown at the time of Strachey?s work, in the context of real-world computer check- 
ers (in which speed issues required a limited number of projected future moves) it 
produced an unexpected behavior. Strachey reported this unexpected result to com- 
puter scientists at the Association for Computing Machinery's national meeting in 


1952, and then put it in layman's terms for his Encounter article: 


There is, however, one feature of the machine's game which was quite 
unexpected and rather interesting. The way in which the machine values 
the positions it reaches when looking ahead is extremely crude. It counts 
each man as one point and each king (being obviously worth more than an 
ordinary man) as three points; the value of any position is the difference 
between its own points and its opponent's points. A large number of the 
positions it examines will, of course, have the same value, and it chooses 
between these at random. 


Suppose now its opponent has a man in the seventh rank so that he is 
about to make a king in his next move, and the machine is unable to 
stop him. The machine will effectively lose two points at its opponent's 
next move, and a human being would realise that this was inevitable and 
and accept this fact. The machine, however, will notice that if it can 
sacrifice a single piece its opponent must take this at once. This leads to 
an immediate loss of only one point and, as it is not looking far enough 
ahead, the machine cannot see that it has not prevented its opponent 
from kinging but only postponed the evil day. At its next move it will 
be faced with the same difficulty, which it will try to solve in the same 
way, so that it will make every possible sacrifice of a single man before it 
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accepts as inevitable the creation of an opponent's king. (p. 28) 


This type of behavior, in which there are complex (and likely some unexpected) 
results from the interactions of simple rules, is often in the digital arts called “emer- 
gent behavior.” In this case, the behavior that emerges is not desirable (it leads to 
bad checkers-playing) but it is notable for being both a completely logical outcome 
of the design of the system and an outcome that even the system's author did not 
foresee. Part of what sparks interest in process-intensive digital literature is the pos- 
sibility it seems to hold out for more positive forms of emergence — which will be 
able to surprise not only the system authors, but also the audience. 

Encounter readers were not given an explanation of how Strachey sought to ad- 
dress this problematic result, but he did give more information to the audience at the 


ACM meeting (Strachey, 1952a): 


In order to avoid this difficulty, the second strategy was devised. In this 
the machine continues to investigate the moves ahead until it has found 
two consecutive moves without captures. This means that it will be able to 
recognise the futility of its sacrifice to prevent Kinging. It is still necessary 
to impose an over-riding limit on the number of stages it can consider, 
and once more, considerations of time limit this. However, as no more 
[sic] is continued for more than two stages unless it leads to a capture, 
it is possible to allow the machine to consider up to four stages ahead 
without it becoming intolerably slow. This would mean that it would 
consider the sacrifice of two men to be of equal value to the creation of an 
opponent's King, and as there is a random choice between moves of equal 
value, it might still make this useless sacrifice. This has been prevented 
by reducing the value of a King from 3 to 2f. (p. 49) 
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What is the generator's game? 


The above gives some indication of the level of complexity that Strachey?s curiousity- 
driven (rather than NRDC-assigned) Mark 1 programs were able to achieve. Given 
this and our previous discussion, a series of observations present themselves for our 
interpretation. It is a potentially puzzling combination of facts. How should we con- 
sider the love letter generator?s deliberate simplicity, its statelessness and randomness, 
and the fact that its vocabulary is a transcription from a thesaurus? This may not 
seem a puzzling set of facts on their own, but it seems one once we're reminded of the 
fact that this is not a project tossed off and then forgotten. In addition to Strachey 
and Turing's amusement at the time, Strachey also wrote of the love letter generator 
for Encounter two years later, and the project made enough of an impression that it 
has appeared in many accounts of his work, Turing's work, and early work with the 
Mark 1. 

Given all this, the most compelling interpretation, to me, is that the love letter 
generator is a process designed to fail. Just as, when Polonius enters the stage, the 
audience waits for the next spectacularly vapid truism to escape his lips, so Strachey 
and Turing watched the teleprinter, knowing the processes that were going on within 
the Mark I, and waiting for the next formulaic jumble of those words most socially 
marked as sincere in mainstream English society. To put it another way, the love letter 
generator — in the way it operated — was a blunt parody of normative expressions 
of desire. It played the role of the lover as an inept spouter of barely-meaningful, 
barely-literate sentences, composed with repetitive randomness while one finger still 


rested in the thesaurus. Further, the examples chosen for preservation by Strachey 
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appear to be those with particularly strong surface markers of mindless permutational 
patterns. 

As a linguistic process designed to fail spectacularly and humorously, through 
randomness, the love letter generator is certainly not alone. Perhaps the best-known 
examples are the Mad Libs books, first published later the same decade (1958) by 
Roger Price and Leonard Stern (Stern, 2001). Like the love letter generator, Mad 
Libs are defined by a process that fills in adjectives, adverbs, nouns, and verbs within 
given sentence structures. But Mad Libs can also request exclamations, geographical 
locations, numbers, colors, parts of the body, and so on. And most of the words in 
any given Mad Libs text are not blank, but instead form a skeleton of a traditional 
text on a particular subject, with only strategic words left open. Further, Mad Libs 
are not combinatorial. Rather than all the possible words of each open sort being 
part of the work, Mad Libs fills in its blanks by drawing on the suggestions of players 
who do not know the subject matter of the text for which they're providing material. 
That is to say that, for example, rather than choosing among verbs on a provided 
list, players of Mad Libs are free to use any verb they can recall. 

The result is a process that everyone knows will fail. One player, who has seen the 
terms needed for the Mad Libs text, asks the others for the necessary types of words. 
The players then joyfully call out the most potentially inappropriate suggestions they 
can imagine. There is some humor in this, but much more in the anticipation — in 
waiting to see how this gathering of data will, when combined with the given data 
through the Mad Libs process, result in a ridiculous text. The anticipation is released 


as the final text is read, often to great laughter. But no one keeps the resulting texts 
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around afterward. They're only funny as the anticipated, yet unpredictable, result 
of the Mad Libs process. And certainly, in this way, the love letter generator's 
products are more like Mad Libs than like the carefully-crafted linguistic foolishness 
of characters like Polonius. However, it is important to note that Polonius is meant 
to represent a certain type of fool, while Mad Libs are not meant to represent any 
type of previously-recognizable processes. Mad Libs are a humorous process, but not 
a representation — while 1 read the love letter generator as a representation. 

Here, I believe, we come to a point at which we can understand the love letter 
generator, the first experiment in digital literature. It is a process designed to fail, 
employing a thesaurus-based set of word data, and which can result in particularly 
inhuman surface texts (as seen in those selected for preservation by Strachey). We 
understand this combination in context — or, perhaps it is better to say two con- 
texts: the technical context of the early stored-program computer on which Strachey 
worked as well as the social context of 1950s computing culture and the increasingly 
homophobic larger English society. Taking all this together, we can see the generator 
as a parody, through its operations, of one of the activities seen as most sincere by 
the mainstream culture: the declaration of love through words. 

That is to say, I do not see the love letter generator as a process for producing 
parodies, but as itself a parody of a process. The letters themselves are not parodies 
of human-authored letters, rather the letter production process is a parodic repre- 
sentation of a human letter writing process. It is not a subtle parody, driven by a 
complex structures that circuitously but inevitably lead, for example, to the same 


small set of vapid sentiments stored as data. Rather, it is a brutally simple process, 
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representing the authoring of traditional society's love letters as requiring no memory, 
driven by utterly simple sentence structures, and filled out from a thesaurus. The 
love letter generator, in other words, was as complex as it needed to be in order to 


act out a broad parody. 


Plans for another generator 


Coming to an interpretation of the love letter generator, however, does not bring 
us to the end of its story. Strachey's papers in the Oxford Bodleian Library also 
reveal his plans for a second version of the love letter generator. Here, the parody 
would have been somewhat less broad. Also, while the sheer number of possible let- 
ters might not have been greater, the feeling of diversity would have been increased. 
Rather than each letter being an expression of inarticulate desire, the second version 
of the generator would have operated according to themes such as “write to me,” 
“answer my letter,” “marry me,” “stop seeing that man,” and “tell your mother.” 
Further, each output would have had a general style, such as “reproachful,” “yearn- 
ing,” “impatient,” “grateful” and “reminiscent.” Strachey's notes contain many 
sample sentences for different permutations, such as: 

I can't imagine why you are always seeing that man 

How can you be so cruel as not to stop seeing that man? 


Do I dare to ask you to stop seeing that man? 


Don't go on seeing that man or 1 shall never speak to you again 


His notes also provide many grammars for producing sentences along these lines. 


Here is a simple one: 
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the universe 
the world 
existence 
England 


beautiful 
lovely 
delicious 
exquisite 
wonder ful 
who is nearly as exciting as yOu. 
attractive 
charming 
lovable 
nice 
inspiring 


girl 
There is no | woman | in 
one 


Here is the ending of a more complicated sentence structure: 


in my arms 
you sitting beside me 
among my wires 
beautiful 


lovely De 
beastly OO tip tilted nose 


..25 | wretched | without ys curly hair 
A exquisite , 
miserable slender waist 


ib onde tapering fingers 
exciting E q 
: dimpled chin 
ae sparkling eyes 
charming Pp ge) 
lovable 


Strachey's plans are a pleasure to read through — with clever turns of phrase, 
amusingly structured processes and grammars, and occasional surprising nods (like 
the “among my wires” above) to the conceit that it is M. U. C. who declares love 
in these letters. But the project was abandoned, and Strachey's papers give no 


indication of the cause. Personally, my thoughts turn to the context of Strachey's 
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work. I cannot help but wonder if it all began to seem less funny in the face of what 
had happened to the man with whom Strachey had stood and laughed while the first 


generator went about its foolishness. 


When Strachey and Turing stood together in the summer of 1952, waiting for M. U. C.'s 
next pronouncement of love, Turing was on a forced program of hormone injections. 
Intended to render him impotent, they were also causing him to grow breasts and 
perhaps impairing his thinking. The injections were part of Turing's sentence — 
having been convicted, on March 31st of that year, of “gross indecency.” Turing and 
Strachey may have laughed at the half-witted pronouncers of heterosexual love so 
roughly parodied by the love letter generator, but that meant laughing at a society 
all too willing to reject, incarcerate, and hormonally alter them for the simple fact of 
their homosexuality. 

Two years after that summer, on the evening of June 7th 1954, Alan Turing took 


his own life. 


Revisiting Surface, Data, and Process 


We might say that Lytton Strachey's Eminent Victorians and Christopher Strachey?”s 
love letter generator are both humorous critiques of conservative elements in English 
culture and those who held them in overly high esteem — one work operating through 
what it says, the other through what it does. The challenge for digital literature, put 
one way, is to effectively operate on both levels. And yet, as we attempt to create and 


interpret such work, a view of its most basic operations encompasses three elements: 
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surface, data, and process.*? 


This is not necessarily obvious at first blush. Works of electronic literature can 
present surfaces to their audiences much like the surfaces of traditional literature, 
film, and other non-digital art forms. But writers who begin to work in this area 
immediately discover that the ground has changed. Accustomed to fashioning a 
text that their audience will read, accustomed to fashioning a surface, they find 
themselves, instead, the fashioners of data. The sentence templates and lists of 
words for Strachey's generator are quite different from traditional writing. Rather, 
they are textual elements, shaped to work with the generator's particular processes. 
If we consider only a selection of sample surface outputs, only a few results from the 
action of these processes on this data, we will have a quite partial view of the work. 
We may, in fact, learn as much about the decisions made during the selection of these 
examples of surface output (in the case of the generator, Strachey's) as anything else. 


It may also be tempting to think that, in the creation and interpretation of digital 





13 At the same time, it is also important to note that the categories of surface, data, and process 
cannot be cleanly separated. As revealed by the discussions of this chapter and the previous one, 
these elements are deeply dependent on each other — with data and process often shaped to work 
with one another, and both designed to produce the desired surface effects. We can usefully discuss 
digital works as being composed of these elements, but we should not confuse ourselves by believing 
it possible to disconnect them completely and still understand them. 

More formal objections to the distinction are also possible, as Chris Crawford points out: “Ex- 
perienced programmers know that data can often be substituted for process. Many algorithms can 
be replaced by tables of data.... Because of this, many programmers see process and data as inter- 
changeable.” But such arguments generally grow from approaching the issues at the level of minutia, 
rather than an attempt to think about what is important in interpreting digital work. As Crawford 
says of the case he mentions, “T'his misconception arises from applying low-level considerations to 
the higher levels of software design.” That said, there are approaches to thinking about the fuzzy 
boundary between data and process that are potentially more significant for our purposes here — 
such as algorithmic information theory. Unfortunately, a deeper consideration of such approaches 
will have to be set aside for future work. 
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literature, we need to deal with its digital nature “in essence.” This essence might lie 
in the presumed binary nature of digital information, or in the construction of modern 
computers from silicon wafers, electrical wires, magnetic and optical storage media, 
and so on. But, as discussed in the previous chapter, this is an illusion. “Digital” 
simply means “discrete” — and for the data and process definitions of digital litera- 
ture to be stored in discrete binary numbers does not determine anything interesting 
about them. Rather than needing to ponder the meaning of the digital, we are faced 
with a much more complex and time consuming task: learning to read, to interpret 
and understand, processes. And this will require, as a starting point, beginning to 
identify those features of particular processes that actually help us situate and grasp 
them. This chapter's approach to this has been comparison between processes, a 
method this study will continue to pursue in the coming pages. 

It has doubtless been noticed, however, by some readers, that surface, data, and 
process were not the only elements of this chapter's interpretation of the love letter 
generator. There was also a fourth element, one which it might not be appropriate to, 
in the same manner, identify as “part” of the work: its technical and social context. 
And certainly it is true that this chapter's interpretation would not have been possible 
without a consideration of context, a topic identified for its pervasive importance in 
the previous chapter. Its presence will continue to be strongly felt in the next chapter, 


as we dig more deeply into our consideration of processes. 


Chance and Choice 


Reading Randomness 


Both Christopher Strachey?s love letter generator and Raymond Queneau's One Hun- 
dred Thousand Billion Poems (Cent mille milliards de poéms) can produce a large 
number of texts — generate many aesthetic surfaces — by arranging a fixed set of ma- 
terials in different ways. But what determines the arrangements? A simple answer, 
in both cases, is “randomness.” The author leaves the arrangement up to chance. 

Yet, at the same time, the mechanisms of chance, the operations of randomness, 
are quite different between the two works. In one case we have a decision left up to 
random selection by the computer and in the other a decision left up to the choice of 
the audience. Both will have highly varying results, unknown to the author — both 
are indeterminate from the author's perspective — but this perspective, the author's 
perspective, is only part of the picture. 

Which brings us to two further issues in need of our attention, as we learn to read 
processes. First, we must develop a more nuanced understanding of indeterminacy. 


Second, we must examine the similarities and differences between processes that are 
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carried out by the audience and those carried out by the author or work. 

The naive understanding of indeterminacy”s role in digital fictions is that 1t forms 
the basis of every work. As Scott Turner, author of the story generator Minstrel, 
reports, “A common question when people first hear about Minstrel is “How does it 
make up stories? Does it just make random choices?” ” (1994, 67). In fact, Turner's 
system makes random choices quite rarely — much less often than, say, Strachey's 
love letter generator — and only makes them between options it has already generated 
through a series of complex processes. 

For other processes, however, indeterminacy actually is primary. This chapter 
examines three influential text generation processes for which this is the case — 
two associated with the experimental writing and art traditions in which Queneau's 
work is situated (from Dada and the Surrealist movement) and one associated with 
the computer and information sciences tradition in which Strachey worked. The 
three display quite different approaches to indeterminacy, and close examination of 
them will help us develop our own initial set of concepts for thinking about such 
operations. It will also help develop our understanding of the historical roots of other 
work considered in this study. 

Before this, however, we will look at a model of surface, data, and process from 
Espen Aarseth's influential Cybertext (1997). Aarseth's model has provided an im- 
portant set of tools for others comparing processes — processes carried out both by 
audiences and by works — through the variables of his “traversal functions.” With 
guidance from this approach we will consider the differing versions of indeterminacy 


in Queneau and Strachey's projects, and also begin to open the complicated question 
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of the relationship between audience and work processes. 

These examinations will also form the initial steps — continued for the remainder 
of this study — toward a deeper consideration of “generating” aesthetic language and 
fictions. What are the elements of such generation work, and what approaches do 
we need to consider? Through this investigation, the notion that has so far in this 
study been called “process” will be considerably complicated. The next chapter will 
build on the work of this one in order to start developing a richer set of concepts and 
vocabulary for processes, and also further refine our understanding of indeterminate 


operations. 


Cybertext's Surface, Data, and Process 


The distinction between surface, data, and process — which was the foundation 
of the previous chapter?s discussion of Strachey?s generator — is not new. This 
distinction is one of the basic concepts presented in computer science education, for 
example. However, in computer science the terms output and interface are commonly 
used for what 1 have called “surface,” and the structures of “processes” carried out 
by computers are often referred to as algorithms.! A distinction like that between 
surface, data, and process is also known in literary studies, most commonly in the 
formulation introduced by Espen Aarseth in his book Cybertert (1997). Aarseth refers 


y) 


to surface texts as “scriptons,” textual data as “textons,” and processes for revealing 





1A computer interface, of course, is always the result of processes — it is always an output — 
but the word interface makes clear that it is also a site of audience interaction. 
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or generating surface from data as “traversal functions.” Aarseth introduces the three 


parts of his model in this paragraph: 


A text, then, is any object with the primary function to relay verbal infor- 
mation. Two observations follow from this definition: (1) a text cannot 
operate independently of some material medium, and this influences its 
behavior, and (2) a text is not equal to the information it transmits. /n- 
formation is here understood as a string of signs, which may (but does 
not have to) make sense to a given observer. It is useful to distinguish 
between strings as they appear to readers and strings as they exist in the 
text, since these may not always be the same. For want of better terms, 
I call the former seriptons and the latter tertons. Their names are not 
important, but the difference between them is. In a book such as Ray- 
mond Queneau's sonnet machine Cent mille milliards de poems, where 
the user folds lines in the book to “compose” sonnets, there are only 140 
textons, but these combine into 100,000,000,000,000 possible scriptons. In 
addition to textons and scriptons, a text consists of what 1 call a traversal 
function — the mechanism by which scriptons are revealed or generated 
from textons and presented to the user of the text. (p. 62) 


This three part model has been immensely influential, with Aarseth's terminology 
taken up by many later authors and his construction of the role of processes becoming 


the primary model introduced to many students of digital literature. 


Traversal functions 


The variables presented by Aarseth for describing traversal functions are the portion 
of Cybertezt that has most influenced the field”s thinking about processes. This para- 
eraph will briefly summarize them, drawing from Aarseth's pages 62-65. Dynamics 
describes whether the work's surface and data can change in particular ways — re- 


maining static, with only surface variability, or also variability in the number of pieces 
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of textual data in the system. Determinability describes whether the work's processes 
operate predictably in their production of textual output, or if the processes for pro- 
ducing surface texts can be influenced by unpredictable factors (e.g., randomness) 
and so yield different responses to the same audience actions. Transiency describes 
whether the work's processes cause surface texts to appear as time passes (e.g., as in 
textual animations). Perspective describes whether an audience member determines 
the strategic actions of a particular character. Access describes whether all possible 
surface texts are available to an audience member at any time (e.g., a book can be 
flipped through, so its surface texts are “random” access). Linking describes types of 
user-selectable connections that may be presented by the work's surface (such as links 
on World Wide Web pages) which may be always available, available under certain 
conditions, or simply not present. User functions are Aarseth's last variable. Every 
text makes available the “interpretive” user function to its audience. Other works 
may give the impression that an audience member is selecting a path, providing an 
“explorative” user function. If, instead, the impression is that the audience member 
is (at least in part) selecting or creating surface texts, this provides a “configurative” 
function. Finally, if textual data or text-producing processes are permanently added 
to the work by an audience member, this function is “textonic.” 

From this we can see that it is not quite accurate to say that what this study 
calls “processes” Aarseth calls “traversal functions.” Instead, traversal functions are 
only those processes that generate (and present) surface texts from data. Similarly, 
Aarseth's “scriptons” are only the textual portions of what this study would call 


“surfaces” and his “textons” are only the textual portions of what this study would 
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call “data.” So, for example, the changes in internal state and surface appearance of a 
simulated character or fictional world would be outside the scope of Aarseth's model 
— except for those state changes that could be fruitfully understood as altering how 
the work produces surface text from textual data. The same would be true of the 
numerical data used in natural language generation, the image data used in computer 


games, and so on. 


Comparing processes 


With that said, within this limited scope it is encouraging to see that Aarseth's 
Cybertext provides an example of what was sought in the previous chapter — a set 
of potentially significant features (Aarseth's “variables”) that can be used to find 
starting points for interpreting processes.? Aarseth arrived at these variables by 
careful comparison of many “cybertexts” (as Aarseth defines the term, a “cybertext 
is a machine for the production of variety of expression,” p. 3) and using his variables 
may save us significant work. We may not need to perform a set of comparisons 


ourselves (such as the previous chapter's comparison of Strachey”s love letter generator 





20One of the most important contributions of Aarseth's work to the study of digital media is 
his insistence that it is the processes that matter, rather than, say, binary data storage or CRT 
monitors. His 2004 preface to the Portuguese edition of Cybertext, written seven years after the first 
English publication, puts it this way: 


One of my main conclusions in Cybertext is that the functional variations within a 
material communication technology are often greater than between different physical 
media. 'To students of digital media, this means that there is very little they should 
presume about the medium just because it happens to be digital. In this perspective, 
the material differences of digital media (between types of computers, screen resolution, 
ergonomic design etc.) are much less significant than the immaterial (e-material?) 
differences: how the system is programmed, and what the program actually does. 
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with Queneau's One Hundred Thousand Billion Poems, Strachey's checkers-playing 
program, and Mad Libs) in order to identify the features of a work that will be 
employed in our interpretation. 

But attempting this kind of work savings does not point us toward what is pri- 
marily useful about Aarseth's system. For example, if we attempt to begin an in- 
terpretation of Strachey?s generator along these lines we find ourselves with the fol- 
lowing starting points: dynamics (intratextonic), determinability (indeterminate), 
transiency (intransient), perspective (impersonal), access (controlled access), linking 
(none), user function (interpretive). This is much less useful, as a starting point, than 
an actual description of the work's processes. 

Rather than this sort of work savings, Aarseth's variables instead provide us a 
helpful starting point for the same sort of comparisons undertaken in the previous 
chapter. For example, let us compare Strachey's generator, once again, with Que- 
neau's One Hundred Thousand Billion Poems. Tf we look at how they differ along 
Aarseth's variables, a number of interesting features emerge immediately. The val- 
ues for determinability are different, because Queneau's poem always results in the 
same text when the audience takes the same actions (“determinate”) whereas the 
randomness employed in Strachey's generator causes different texts to emerge each 
time the program is run (“indeterminate”). Similarly, the continuous availability of 
all Queneau's text makes it a “random access” work, whereas Strachey's generator 
(which keeps all unused data to itself) is “controlled access.” Further, the rearrange- 
ment performed by Queneau's audience is a “configurative” user function, whereas all 


Strachey?*s audience does is run the program and read the output — an “interpretive” 
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user function. 

These differences, identified with the help of Aarseth's variables, illuminate a 
crucial point about these two works: Strachey?s work is founded on randomness, on 
processes that operate differently each time the generator is run, whereas there is 
no randomness of this sort involved in Queneau's work. In fact, from the audience's 
perspective, there is no indeterminacy at all. By making all its possible surface texts 
available at all times, and by responding in a determinate way to audience actions, 
One Hundred Thousand Billion Poems puts all of its processes in the audience's 
hands. The author may not know what outcome will occur, in any particular reading, 
but it is known that the outcome will result from the choice of an audience member. 
This is a kind of indeterminacy, but very different from the chance or rolling a die or 


having a computer employ a random number. 


Chance in context 


We can be assured of the crucial nature of the above observation by looking at the 
larger context around these works. Alan Turing, as discussed in an earlier chapter (see 
page 28), went to the effort to construct a random number generator that employed 
noise to make truly random numbers available to users of the Mark I. In other words, 
Strachey was working in a context in which randomness was considered of interest. 
On the other hand, the Oulipo, co-founded by Queneau, has declared its opposition 
to chance. The words of Warren Motte, Jacques Bens, and Claude Berge combine 
to put this particularly well in the introduction to Oulipo: A Primer of Potential 


Literature (Motte, 1986): 
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[A]nother way of considering the Oulipian enterprise is as a sustained at- 
tack on the aleatory in literature, a crusade for the maximal motivation 
of the literary sign. All of their work, from short exercises in highly con- 
straining form to far longer texts resulting from the application of Oulip- 
ian theory, from the indications of a nostalgic longing for a mythological 
primitive language to their insistence on voluntary or conscious litera- 
ture, may be read in this light. As Jacques Bens expresses the position: 
“The members of the Oulipo have never hidden their abhorrence of the 
aleatory, of bogus fortunetellers and penny-ante lotteries: “The Oulipo is 
anti-chance,” the Oulipian Claude Berge affirmed one day with a straight 
face, which leaves no doubt about our aversion to the dice shaker. Make 
no mistake about it: potentiality is uncertain, but not a matter of chance. 
We know perfectly well everything that can happen, but we don't know 
whether it will happen.” (p. 17) 


Noting these differing stances toward randomness, of course, does not constitute 
an interpretation of either work — but might well provide a starting point for in- 
terpretations. One has the strong sense that this is the type of difference (in how 
surface is generated from data) that Aarseth's variables are designed to highlight, 
with randomness explicitly named, or nearly so, in values such as “random access” 


and “indeterminate.” 


Author and audience indeterminacy 


But there is another difference between these two works that, while raised by the 
same comparison, feels less highlighted by Aarseth's variables. It is not simply the 
case that one work has random results and the other does not. This difference in 
randomness is a direct result of the fact that Strachey?s generator carries out its own 
processes — it employs a computer to carry out processes that it defines — whereas 


the processes of Queneau's work are carried out by the audience. 
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This difference may feel less highlighted in Aarseth's work because it is a difference 
that, in some ways, the “scriptons, textons, and traversal functions” model disguises 
through its shape. In this model, traversal functions are part of the text (“[ijn ad- 
dition to textons and scriptons, a text consists of what 1 call a traversal function” ), 
and audience actions (from interpretation, through controlling a character, through 
contributing text) are part of the traversal function (the “user function” and “per- 
spective” traversal function variables, in the cases just mentioned). There may be 
less visibility for the difference between audience processes and work processes when 
both are positioned as part of the text. 

More generally, Cybertext works to undermine any special claims for “the digi- 
tal” — and all claims which may give too much focus to the specifics of particular 
media forms — because they distract our attention from the important issue: how 
systems function. This argument begins in the very first paragraph of Cybertezt, 
in which Aarseth states that “the concept of cybertext does not limit itself to the 
study of computer-driven (or “electronic”) textuality; that would be an arbitrary and 
unhistorical limitation, perhaps comparable to a study of literature that would only 
acknowledge texts in paper-printed form.” 

This helps explain another feature of Cybertext's “traversal functions” — the 
fact that aspects like determinability are described using the same variables whether 
the processes are carried out by the audience or a computer. Á random number 
generated by a computer is considered the same as that generated by an audience 
member following instructions embedded within the work that direct her to roll a die 


and do something with the resulting number. From some perspectives (those which 
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focus on the fact of random number generation) this is perfectly appropriate. The 
question we must consider is whether it is appropriate for understanding aesthetic 


experiences. 


Audience and Work Processes 


Lev Manovich, in his essay “New Media from Borges to HTML” (2003), addresses the 
fact that computers now carry out processes that could, conceivably, be carried out 
by humans (or already have been). For example, the sorts of combinatorial processes 
employed in Strachey's generator and Queneau's Poems could be carried out by a 
computer, following process definitions embedded in the work, or by human audience 
members. The question is whether the difference, when the processes are carried out 


by the audience or the work, is significant. Manovich writes: 


Consider, for instance, the computer's ability to represent objects in linear 
perspective and to animate such representations. When you move your 
character through the world in a first person shooter computer game (such 
as (Quake), or when you move your viewpoint around a 3D architectural 
model, a computer recalculates perspectival views for all the objects in the 
frame many times every second (in the case of current desktop hardware, 
frame rates of 80 frames a second are not uncommon). But we should 
remember that the algorithm itself was codified during the Renaissance 
in Italy, and that, before digital computers came along (that is, for about 
five hundred years) it was executed by human draftsmen. Similarly, be- 
hind many other new media techniques there is an algorithm that, before 
computing, was executed manually.... 


Substantially speeding up the execution of an algorithm by implementing 
this algorithm in software does not just leave things as they are. The 
basic point of dialectics is that a substantial change in quantity (i.e., in 
speed of execution in this case) leads to the emergence of qualitatively 
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new phenomena. The example of automation of linear perspective is a 
case in point. Dramatically speeding up the execution of a perspectival 
algorithm makes possible previously non-existent representational tech- 
nique: smooth movement through a perspectival space. In other words, 
we get not only quickly produced perspectival drawings but also computer- 
generated movies and interactive computer graphics. (p. 21) 


Certainly none of us would disagree with Manovich's basic point. There is a 
substantial difference between a computer animating our walk down a hallway and a 
human, sitting across from us, drawing each frame of the animation by hand and then 
showing it to us as a flipbook. The difference would still remain if the human were 
drawing each frame on a computer and showing it to us on a computer screen. The 
difference would be further accentuated if we had to produce the drawings ourselves, 
carrying out instructions provided by the work. The difference is in the computer 
carrying out the process of animation, which creates for the audience an experience 


of interactive navigation through virtual space. 


Process-intensive literature 


Something very similar to Manovich's point about Quake can be said of Strachey's 
generator. The key fact about it is not that it existed in the memory of a digital 
computer.? The key fact is that the computer, as Crawford argues, is a machine for 
carrying out processes — and that the generator employed it in this manner. In fact, it 
did so to an extent that allows us to identify it as not only the first experiment with 


digital literature, but also with process-intensive literature. Queneau's Poems, by 





3 Just as an emailed Virginia Woolf story is not digital literature. 
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presenting a “one of ten” choice to the reader only once for every line of a sonnet, can 
be read relatively fluidly without a computer. But Strachey's generator, as discussed 
earlier, involves a much higher degree of combinatorial choice — with a choice among 
many options needed for nearly every word. It is a literary work that can only be 
read fluidly if its processes have been carried out first (the generator would be nearly 
unreadable if it used the same structure as Queneau's Poems) but because those 
processes are carried out by a computer they can be executed anew for each reading. 

This sort of difference is only growing in importance. Most process-oriented works 
of digital literature use algorithms of a complexity that dwarfs that of those used in 
Strachey's generator — algorithms even further from those that could be carried out 
by a human reader at the time of reading. Further, many digital fictions now interact 
with the audience while the work is being experienced — a feature that enables most 
computer games, for example — and so use their ability to carry out processes in 
a manner quite similar to Manovich's example of Quake: to create an experience of 
response utterly different from what we would experience if carrying out the same 


processes ourselves (or having them performed by another human sitting with us). 


Hidden-process literature 


Jesper Juul, in his Half-Real: Video Games between Real Rules and Fictional Worlds 
(2005), specifically outlines some of the important ways that video games are different 
from non-computer games, even when they involve similar processes. These are his 


first two: 
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1. Rules: While video games are just as rule-based as other games, they 
modify the classic game model in that it is now the computer that upholds 
the rules. This gives video games much flexibility, allowing for rules much 
more complex than humans can handle; freeing the player(s) from having 
to enforce the rules; and allowing for games where the player does not 
know the rules from the outset. 


2. Variable outcome: In many cases, the computer can act as a referee 
in order to determine the outcome of a game on the basis of events that 
would not be immediately discernible to a human. (p. 53-54) 


Translating one of Juul's points to the realm of digital fiction highlights another 
clear consequence of the computer carrying out a work's processes: the processes are 
not necessarily revealed to anyone other than the work's author(s).* On the other 
hand, processes carried out by the audience require specific instruction and direct au- 
dience effort. This distinction can be observed, again, in the different interpretations 
of Strachey's generator and Queneau's Poems. No interpretation of Queneau's work 
fails to mention its processes, and the structure of its processes is often the feature 
of the work most remembered by readers. On the other hand, no previous writing 
about Strachey's generator (except that by Strachey himself) has, to my knowledge, 


even indicated understanding of how its combinatorial processes operated. 


The need for interaction 


In short, the difference between audience and work processes is crucial. We will 
confuse ourselves both about audience experiences and about the form of works if we 


operate with a set of concepts that makes it difficult to distinguish in this way. It is 





%As discussed in the first chapter's section on access to processes, beginning on page 35. 


118 


for this reason that this study uses the concept of interaction to discuss changes to 
the state of a work that come from outside the work (including from the audience). 

To put it another way, a three-part model — whether called “surface, data, and 
process” or “scriptons, textons, and traversal functions” — is insufficient for dis- 
cussing digital literature or other forms of digital media. We must have the tools 
for an analysis that identifies not only what happens — a “process” or “traversal 
fuction” — but also how it happens. We must be able to take different approaches 
to automatic processes and audience interactions, while retaining our ability to com- 
pare them. We need at least a four-part model (and probably one as large as that 


5 However, as indicated at the outset of this study, 


presented in the first chapter). 
here we will background such concerns and largely focus on processes that accomplish 


their work with little audience interaction. 


Three Indeterminate Processes 


Poet and digital literature theorist Charles O. Hartman, writing in Virtual Muse: Ex- 
periments in Computer Poetry (1996), devotes most of a chapter to thinking through 
the relationship of chance, of randomness, to digital poetry. He begins his third 
chapter by reminding us that “One of the Greek oracles, the sibyl at Cumae, used 


to write the separate words of her prophecies on leaves and then fling them out of 





5 Alternately, of course, we could introduce another model to operate in parallel with our first. 
Cybertext in part addresses the issue of audience processes by having another, parallel model. This 
model, as outlined on Cybertext's page 21, also consists of three parts: “operator, verbal sign, 
and medium.” Unfortunately, this model does not receive the elaboration given to the “scriptons, 
textons, and traversal functions” model. 
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the mouth of her cave. It was up to the supplicants to gather the leaves and make 
what order they could” (p. 29). He compares this with his early poetic experiment 
for the Sinclair ZX81, a BASIC program called RanLines that stored 20 lines in an 
internal array and then retrieved one randomly each time the user pressed a key. 
Hartman then declares that this “simple sort of randomness” has “always been the 
main contribution that computers have made to the writing of poetry” (p. 30). 

Hartman goes on to offer a number of practitioner”s insights and general obser- 
vations about approaches to working with chance operations in poetry. In the next 
chapter we will look at some of Hartman's work, and see that a “simple sort of ran- 
domness” may not be an entirely fitting phrase for the computational processes he 
employs. But here, instead, we will pause for a closer look at operations with in- 
determinate outcomes. What does it mean to employ such operations in generating 
texts? How do we understand, and interpret, the operations of processes for which 
“chance” is said to be primary? 

In order to consider these questions we will first begin with the context and 
specifics of three processes often cited in discussions of indeterminate operations and 
creative work. Two are from the experimental writing tradition that includes Que- 
neau's work: Tristan Tzara's newspaper poem (connected with Dada) and Soupault 
and Breton's automatic writing (connected with Surrealism). The last is from the 
information and computer science tradition that includes Strachey's work: Claude 
Shannon's n-grams (proposed in a foundational paper for the mathematics of com- 


munication). 
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Dada 


Let us start with the definition of a famous poetry generation process: 


To make a dadaist poem 

Take a newspaper. 

Take a pair of scissors. 

Choose an article as long as you are planning to make your poem. 
Cut out the article. 


Then cut out each of the words that make up this article and put them 
in a bag. 


Shake it gently. 


Then take out the scraps one after the other in the order in which they 
left the bag. 


Copy conscientiously. 
The poem will be like you. 


And here are you a writer, infinitely original and endowed with a sensi- 
bility that is charming though beyond the understanding of the vulgar. 

This quotation, from Tristan Tzara's 1920 Manifesto on feeble love and bitter love 
(Motherwell, 1981, 92) is one of the most commonly reprinted texts from the Dada 
movement. However, the words that immediately follow those above, in the origi- 
nal text, are a poem composed by this method — which is almost never reprinted.* 
Rather, the method itself is discussed as an expression of what is taken as the move- 
ment's defining characteristic: nihilism. 

In the standard story about Dada, it was a nihilistic, anti-art, anti-bourgeouise art 


movement that began in Zurich in 1916. It was a response to World War I — a tragic 





SDespite the evocative title, “when the dogs cross the air in a diamond like the ideas and the 
appendix of the meninges shows the hour of awakening program (the title is my own)”. 
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display of new technoscientific weaponry (chlorine gas, mustard gas, artillery, machine 
guns) and “rational” trench warfare techniques that left most of a European male 
generation mutilated or dead. The group coalesced around the Cabaret Voltaire, 
a literary/artistic nightclub organized by German group member Hugo Ball, and 
included Emmy Hennings, Richard Huelsenbeck, and Hans Richter from Germany; 
Tristan Tzara and Marcel Janco from Romania; Hans Arp from Alsace; and Sophie 
Táuber from Switzerland. After the war it spread from neutral Switzerland to Paris 
and Berlin. It was not so much an art movement as an attitude. As Dawn Ades 
puts it in the Thames and Hudson guide to Dada and Surrealism (1974), Dada “was 
essentially a state of mind, focused by the war from discontent into disgust” (p. 12) 
and members of the movement believed that the artist was “an anachronism whose 


work was totally irrelevant, and the Dadaists wanted to prove its irrelevance in public” 


(p. 4). 


Understanding an engaged Dada 


The standard story outlined above is now being seriously questioned by those with an 
interest in Dada. Leah Dickerman argues in the introduction to The Dada Seminars 
(2005, 2), “[T]he persistent and generalized characterization of Dada as an “attitude” 
rather than a coherent, if novel, approach to art making has worked to deflect fur- 
ther definition of the logic of the movement's formal procedures and the particular 
social semiotics of its objects.” As this questioning takes place, the simple explana- 
tion of “nihilism” for all the movement's works is being replaced by a more careful 


examination of what Dada worked to create and dispose, especially in relation to 
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the war. Ball had volunteered for German military service three times, but always 
been declared medically unfit. Following this, he visited the front lines of World War 
One in 1914 and was so horrified that he returned a pacifist. As T.J. Demos puts 
it (2005, 8), this trip to the war's front “brought about a crisis that catalyzed his 
expatriatism and nearly ended in suicide.” Ball was involved in anti-war activities 
in Berlin before emigrating to Zurich with Hennings (who was, at that moment, “re- 
cently released from imprisonment for forging passports for those who wished to avoid 
military service” (Ball, 1974, xviii)). In Switzerland Ball contemplated taking his own 
life but, instead, returned to the same lake where he had struggled with suicide and 
dumped German war paraphernalia into the water, rather than himself, and founded 
the Cabaret shortly after (Demos, 2005). Huelsenbeck returned to Berlin in 1917, 
during the war, and catalyzed an angrily anti-war branch of the Dada movement 
there. Jonathan Jones writes that it is “unsentimental fury that makes Berlin Dada 


some of the greatest art of the 20th century” (2005). 


Dada processes 


Our view of Dada is evolving. We no longer see it as a movement driven only by 
nihilism and self-proclaimed irrelevance, but also as one that encompasses unsenti- 
mental anti-war fury. We no longer see it as small clique centered in Zurich, moving 
to Berlin and Paris only when the war was safely over, but as an international move- 
ment almost from the start (especially when one considers New York Dada). As this 
happens, some have argued that it is Dada processes, Dada strategies, that are cen- 


tral to understanding it as a movement and its impact on later work. Turning again 
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to Dickerman, she writes (2003, 9): 


[Ejooking at Dada, separately from Surrealism, across the work of various 
artists and cities of production, makes clear the degree to which it coheres 
through the insistent pursuit and reworking of a set of strategies — col- 
lage, montage, the readymade, chance, and other forms of automatization 
— so foundational for the rest of the century that we have to struggle to 
recognize their historical novelty today. The portability of these strate- 
gies, the way in which they travel between artists and city centers and are 
reinforced and developed, speaks to Dada's crucible effect, the extremism 
of its group dynamics. Dada's cohesion around such procedures points 
to its primary revolution — the reconceptualization of practice as a form 
of tactics. That last word, with all of its military connotations, is help- 
ful: not only does it suggest a form of historical mimicry, a movement 
from battlefield to cultural sphere, but also it helps to define a mode of 
intervention within a predetermined field, intended as assault. 

To meet this challenge, we will need to learn to read Dada's processes. And 
not only well-known processes such as Tzara's method of making poetry, randomly, 
from postwar newspaper accounts (proposed under the sign of “feeble love and bitter 
love”). We also need to read less celebrated examples such as his “static poems,” 
presented in Zurich during the war, that “consisted of chairs on which placards were 
placed with a word written on each, and the sequence was altered each time the 
curtain was lowered” (Motherwell, 1981, 132). 

But do we also need to read Dada's surface and data? Perhaps not. As men- 
tioned previously, while the instructions for Tzara's newspaper poem process are 
widely reprinted, the surface text example that followed is not. And while Robert 
Motherwell's The Dada Painters and Poets describes Tzara's static poems, it pro- 


vides no examples of them. It may be that, while many specific Dada works retain 


their power to reach us (from Ball's sound poems to Otto Dix's war-focused collages), 
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Dada's most significant area of artistic innovation was in the creation of processes. It 
may be these processes — and the way that they call the infinitely original sensibility 
of the Goethean artist into question, along with the logics of nationalism, rationalism, 
and property to which it is connected — are the work to which we should look. Per- 
haps rather than read individual works, to understand Dada we should read processes 
as employed across many works, or even independently of any examples of work at 
all. 

This is a question we will return to, in a different form, with regard to the Oulipo's 
concept of “potential literature.” In the mean time, let us hold off on concluding 
that data and surface are of secondary importance in Dada work, and particularly 
in the example of Tzara's poetry generation. Let us first consider the processes of 


“automatic writing” and “n-grams.” 


Surrealism 


The Dada and Surrealist movements are often discussed together. In part this is 
because they were historically connected — one following the other, and with shared 
locations and members. But it is also, in part, because of a perceived resonance in a 
number of their of strategies, a number of their processes. 

The history of Surrealism is generally traced back to Paris in 1919, when the jour- 
nal Littérature was founded by André Breton, Louis Aragon, and Philippe Soupault, 
joined shortly afterward by Paul Eluard. Littérature was an experimental journal and 
friendly toward Dada, including reviews of Tzara's work, and then texts of Tzara's, 


in early issues. The group around the journal looked forward to Tzara's January 1920 
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arrival in Paris and it was shortly thereafter, at a Littérature-sponsored event, that 
Tzara's cut-up poem was famously produced — by cutting up an article by Léone 
Daudet and reading the results while Breton and Aragon rang bells loudly. Ruth 
Brandon, in her Surreal Lives (1999), quotes Aragon: “For once, expectation was 
fulfilled ... It was three minutes of indescribable hullabaloo, and the audience, which 
had never experienced anything so insolent [at that time it was disrespectful even to 
read poems sitting down], was furious. They yelled and threatened” (p. 139). 

On some level, this was a great success — for a poetic process to make such an 
impression on an audience less than two years away from the daily horrors of war. But 
already Littérature had begun to publish texts produced by a different process, one 
which bore the mark of the group's (and, particularly, emerging leader Breton's) very 
different outlook from Dada's. The October, November, and December 1919 issues 
of the journal presented the first three chapters of Magnetic Fields (Les Champs 
Magnétiques) Soupault and Breton's famous work of “automatic writing” (Brandon, 


1999, 160). 


Automatic writing 


Automatic writing seeks to produce text by a means different from conscious thought, 
but also not through the seemingly unguided randomness of Tzara's newspaper poems 
and similar techniques. It seeks writing, instead, from unconscious thought. The 
instructions for automatic writing in Surrealist Games (Brotchie and Gooding, 1991, 


17) read: 


Automatic writing is the most direct of Surrealist techniques. 
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Sit at a table with pen and paper, put yourself in a “receptive” frame of 
mind, and start writing. Continue writing without thinking about what is 
appearing beneath your pen. Write as fast as you can. If, for some reason, 
the flow stops, leave a space and immediately begin again by writing down 
the first letter of the next sentence. Choose this letter at random before 
you begin, for instance a “t,” and always begin this new sentence with 
a 


This has generally been considered a chance process, and discussed along with 
Tzara's newspaper poem process. The fit, of course, is not exact. The source of 
randomness in automatic writing is not terribly clear. But even those who have 
considered this have often reached conclusions such as George Brecht's in Chance 
Imagery (1966) that “it is practical to consider chance as being defined by consciously 


unknown causes, and by this definition, at least, automatism is a chance process.”” 





"Soupault and Breton practiced the automatic writing technique together, passing the 
manuscript back and forth, each attempting to continue the text with unconscious thought. Breton 
attributed the inspiration for this to Freud's ideas. As he would later write in his first Manifesto 
of Surrealism (1924), “Completely occupied as Í still was with Freud at that time, and familiar as 
I was with his methods of examination which 1 had some slight occasion to use on some patients 
during the war, 1 resolved to obtain from myself what we were trying to obtain from them, namely, 
a monologue spoken as rapidly as possible without any intervention on the part of the critical facul- 
ties, a monologue consequently unencumbered by the slightest inhibition and which was, as closely 
as possible, akin to spoken thought” (1969, p22-23). But while Freud's techniques were certainly 
employed in treating French victims of shell shock at the time discussed by Breton, Freud's writings 
were not yet translated, and according to Soupault the ideas that animated Magnetic Fields were 
derived at least as much from Pierre Janet (Brandon, 1999, 157). Janet had been publishing about 
automatic writing and its relation to the unconscious since the 1880s, in French, starting with an 
article discussing his patient Lucie. Janet and Lucie had been practicing relatively standard hyp- 
nosis techniques, but eventually “discovered an experimental counterpart to the automatic writing 
of the spiritualists, writing governed by a mysterious and hidden self that existed independently 
of normal consciousness, a “second state' as Janet would term it, that was produced “by a certain 
doubling [dédoublement] of consciousness” ” (Rainey, 1998). The spiritualists were also an influence 
on Surrealism. The Surrealists dubbed medium Hélen Smith “The Muse of Automatic Writing” 
(Brotchie and Gooding, 1991, 141) and for a period adopted the practice of putting members into 
medium-like trances. 


12 


Surrealism's trajectory 


The split of the Littérature group with Dada is often attributed to their “more pos- 
itive” goal of creating literature through processes that open the exploration of the 
unconscious, epitomized by automatic writing — although personality conflict was 
certainly another major factor. The split had clearly already begun in 1922, and 
was cemented on July 6th, 1923 when Breton mounted the stage during an evening 
organized by Tzara. Breton ordered off the stage Pierre de Massot, who was then 
performing a disturbing text elegizing living people (e.s., “Francis Picabia, died in 
battle / Pablo Picasso, died in battle”) some of whom were in the audience. When 
de Massot mumbled in reply Breton struck him with his cane so hard as to break de 
Massot's left arm (Brandon, 1999, 168). 

The first Manifesto of Surrealism, with its focus on freeing the imagination, au- 
tomism, the unconscious, the arbitrary, disordering the senses, and removal from 
everyday language (and with a considerably less confrontational attitude) was pub- 
lished the next year. With this the group had a name and an identity. Over the 
following years international politics would become increasingly important to the 
Surrealists, and in time it was positioned as operating under the dual sign of Freud 
and Marx — seeking both internal and external liberation. 

The practices of automism at first seemed so central to some members, and the 
possibility of a true visual analogue of automatic writing so unlikely, that some con- 
cluded there could be no truly Surrealist visual art (Montagu, 2002). But Breton and 
others held the opposing view, and visual artists became increasingly important in 


the movement. Over the years many writers and artists joined and left the group, 
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with those leaving doing so both of their own accord and via the excommunications 
of charismatic and controlling Breton. Writers and artists associated with Surrealism 
include: René Magritte, Georges Bataille, Joan Miró, Max Ernst, Pierre Naville, Ben- 
jamin Perét, Antonin Artaud, René Crevel, Salvador Dalí, André Masson, Alberto 
Giacometti, Man Ray, Yves Tanguy, and a number of others — such as Raymond 


Queneau, author of One Hundred Thousand Billion Poems. 


Surrealism's processes 


Automatic writing was not the only generative process explored by the Surrealists. 
They invented and adapted (from sources such as surveys and parlor games) many 
group and individual processes that sought to open access to the irrational and the 
evocative juxtapositions of “objective chance.” One of the most famous of these is the 
“exquisite corpse,” a group process first practiced around 1924-25. The process can 
be performed with text or imagery. In each case, a paper is folded (e.g., into horizontal 
bands) and the participants take turns, each doing their work on one of the sections 
and then turning a fold so that their contribution is not visible to the participants that 
follow. In the textual version a sentence structure is selected that determines each 
participant's contribution. The example given in Surrealist Games is: player one, 
definite or indefinite article and an adjective; next player, noun; next player, verb; 
next player, another definite or indefinite article and adjective; next player, another 
noun. The process is said to be named for the first sentence generated in this fashion: 
“The exquisite corpse shall drink the new wine.” More complex textual structures, 


and those that accommodate longer individual contributions, are also used. The 
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visual version of exquisite corpse, on the other hand, has significantly less structure 
— though it is common for the participants to agree that the image will portray some 
sort of creature. In creating a visual exquisite corpse participants draw in one section 
of the paper, and then allow the lines of their drawing to continue for a short distance 
beyond the fold. The participant who draws in the next section chooses to continue 
these lines in some way, and then continue her own lines beyond the next fold, and 
so on until the paper ends. 

Surrealism was, in essence, defined in opposition to Dada. And yet it is interesting 
to note that the exquisite corpse technique, in its visual form, was being practiced by 
Otto Dix and Christoph Voll (while Dix was associated with Dada) 2-3 years before 
it was independently discovered by the Surrealists (Brotchie and Gooding, 1991, 153). 
The textual form is not only described by Surrealists, but also by Tzara (ibid, 144). 
And while William S. Burroughs famously complained that Breton cut off Tzara's 
experimentation and “grounded the cut-ups on the Freudian couch” (1978), in fact 
it could be argued that Breton continued the work of Tzara's newspaper poems — 
including in the first Manifesto of Surrealism a text composed of “headlines and 
scraps of headlines cut out of the newspapers” and entitled “POEM.” 

But was Breton continuing Tzara's work? One has the sense, looking at Breton's 
newspaper text, that the scraps were selected carefully and the whole arranged artfully 
(even if, perhaps, “unconsciously”). He may say, in this part of the Manifesto, that “It 
is even permissible to entitle POEM what we get from the most random assemblage 


possible” — but this is not the most random assemblage possible, and not even 
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8 However, it may 


the most random recently practiced with Breton's participation. 
be the most random assemblage that, through the contribution of an author, creates 
evocative juxtapositions of language. Unlike the texts produced by Tzara's technique, 
Surrealist texts are commonly reprinted (even without accompanying descriptions of 
their processes). This may indicate that they are more successful on a literary level, 
but it also indicates that they do less to disorder our normal sense of the literary 
(and authorship). That is to say, each group seems to succeed at its own, quite 
different, goal — even while operating via processes that can be described with similar 
language. Both, for example, emphasize indeterminacy and irrationality, and do so 
in a way that influenced future work both in experimental literature and the digital 
arts. Given the importance of “chance” processes, of the aleatory, for both groups, 


we find ourselves returning to the question: How do we interpret indeterminacy in 


generative processes? 


Shannon's n-grams 


Interpreting indeterminacy will require that we develop a clearer understanding of 
what seemingly random processes do. Or, to put it another way, we need to situate 
the indeterminate elements of generative processes within a larger view of what the 
processes perform with their data. To begin this work we will look at a process that 
is connected with traditions of computer science, rather than experimental literature 


and art. 





Remember that Breton was a bell ringer during Tzara's famous performance. 
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This process comes from Claude Shannon, sometimes referred to as the “Newton 
of the Information Age.” Shannon's nickname comes from the fact that he made ma- 
jor contributions to formulating the mathematics of communication — what he called 
“the fundamental problem of ... reproducing at one point either exactly or approxi- 
mately a message selected at another point” (Shannon, 1948). This quotation comes 
from a paper that was a milestone in the field, both when published on its own (as “A 
Mathematical Theory of Communication”) and when repackaged as a book with an 
introduction by Warren Weaver (as The Mathematical Theory of Communication). 
The portion of this work that particularly interests us is Shannon's description of 


stochastic approximations of English. 


Random messages 


Shannon provides six sample messages in his description. In the first, each of the 


alphabet's 26 letters and the space appear with equal probability: 


XFOML RXKHRJFFJUJ ZLPWCFWKCYJ FFJEYVKCOSGHYD QPAAMK- 
BZAACIBZLHJQD. 
In the next, the symbols appear with frequencies weighted by how commonly they 


appear in English text (i.e., “E” is more likely than “W”): 


OCRO HLI RGWR NMIELWIS EU LL NBNESEBYA TH EEI ALHEN- 
HTTPA OOBTTVA NAH BRL. 
These messages were generated using two processes with random elements. The 
first was generated using a book of random numbers, which determined the letter (or 


space) that would be recorded next. The next message also used the book of random 
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numbers, but in addition employed a table of letter frequencies, so that letters used 
more often in English would appear more often in the output. 

That is to say, the same random process generated different results because it 
was operating on different data sets. For the first, the data was simply the alpha- 
bet and the space considered equally. For the second, the data determined greater 


representation, in the surface results, of the letters more commonly used in English. 


N-grams 


Shannon's remaining four sample messages were produced with a somewhat different 
process. Let us take a look at them. In the third message, symbols appear based 
on the frequencies with which sets of two of the symbols appear in English. That 
is to say, after one letter is recorded, the next is chosen in a manner weighted by 
how commonly different letters follow the just-recorded letter. So, for example, in 
generating the previous message it would have been important that “E” is a more 
common letter than “U” in English. However, in making this third message, it 
becomes important that, if there is a pair of letters in English that begins with “Q” 
it is much more likely that the complete pair will be “QU” than “QE.”” Taking 


the frequencies of pairs into account in this manner means paying attention to the 





%Until the Middle East war begun by the U.S.'s first Bush administration, the frequency of any 
character other than “U” following “Q” in U.S. English was very low. Since that war, the increased 
usage of words such as “Iraq” has changed the language's statistics, making the word space character 
much more likely to follow a “Q” than it was before. With the September 11th terrorist attacks 
on the U.S., the language's statistics changed again, with “al-Qaeda” making “QA” a much more 
common pair than it was previously. 
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frequencies of “digrams.”*% The sample message created in this way is: 


ON IE ANTSOUTINYS ARE T INCTORE ST BE S DEAMY ACHIN D 
ILONASIVE TUCOOWE AT TEASONARE FUSO TIZIN ANDY TOBE 
SEACE CTISBE. 


In the fourth, symbols appear based on the frequencies with which sets of three 
of the symbols appear in English. This is called a “trigram” — with each choice of 
the next letter being weighted by the frequencies with which various letters follow 


the set of two just recorded. This sample message is: 


IN NO IST LAT WHEY CRATICT FROURE BIRS GROCID PONDE- 
NOME OF DEMONSTURES OF THE REPTAGIN IS REGOACTIONA 
OF CRE. 


In the fifth, the unit is moved from letters to words. In this message, words 
appear in a manner weighted by their frequency in English, but without attention to 


the prior word: 


REPRESENTING AND SPEEDILY IS AN GOOD APT OR COME 
CAN DIFFERENT NATURAL HERE HE THE A IN CAME THE TO 
OF TO EXPERT GRAY COME TO FURNISHES THE LINE MES- 
SAGE HAD BE THESE. 


Finally, in the sixth sample message, words are chosen based on the frequency with 
which pairs of words appear in English. This, again, like the technique of choosing 


based on pairs of letters, is called a “digram” technique. The final message is: 





10Shannon uses the term “digrams,” though later usage in the Natural Language Processing 
community has at times preferred “bigrams.” 
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THE HEAD AND IN FRONTAL ATTACK ON AN ENGLISH WRITER 
THAT THE CHARACTER OF THIS POINT IS THEREFORE AN- 
OTHER METHOD FOR THE LETTERS THAT THE TIME OF WHO 
EVER TOLD THE PROBLEM FOR AN UNEXPECTED. 


Randomness and data 


One interesting thing about the last four samples is that they were constructed using 


ordinary books. Shannon explains the process as follows: 


To construct (3) for example, one opens a book at random and selects a 
letter at random on the page. This letter is recorded. The book is then 
opened to another page and one reads until this letter is encountered. 
The succeeding letter is then recorded. Turning to another page this 
second letter is searched for and the succeeding letter recorded, etc. A 
similar process was used for (4), (5) and (6). It would be interesting 
if further approximations could be constructed, but the labor involved 
becomes enormous at the next stage. 


That is to say that the last sample message (which begins with a sequence that 
sounds surprisingly coherent) was generated by opening a book to a random page, 
writing down a random word, opening the book again, reading until the just-recorded 
word was found, writing down the following word, opening the book again, reading 
until that second word is found, writing down the following word, and so on. 

Why does this work? Why is using a regular book, rather than elaborate tables 
of letter or word frequencies, an acceptable way for Shannon to construct his sample 
messages? The answer is that Shannon is operating on the (basically supportable) 


assumption that ordinary books appropriately reflect the frequencies of letters and 
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words in English.** And this, in turn, is why there is a passage of surprising coherence 
in the last sample message. Because choosing an ordinary book is actually choosing 
a piece of highly-shaped textual data (shaped, for example, by the frequencies of 
words and chains of words in the book's language, the topic of the book, the book's 
presumed audience, etc.) and generation processes with random elements do not 
necessarily produce random surface texts. Rather, random processes produce results 


that reveal information about the data on which they act. 


Comparing the processes 


With the insight that random processes reveal information about the data they em- 
ploy, let us return to Dada and Surrealism. Tzara's process — of cutting up a text 
into individual words and then composing a poem by drawing them randomly from 
a sack — certainly has a random element to it. But the result is not random. It 
produces a text very much shaped by its data: a particular postwar newspaper arti- 
cle, its words, and its distributions of different types of words. A different selection 
of data would produce very different surface results. And while not much focus has 
been given to Shannon's selection of data for his sample messages, the same could 
certainly be said there. While many “ordinary” books might allow us to generate 
a surface text including a “frontal attack on an English writer,” many others would 
not. 


Breton and Soupault's process of automatic writing, on the other hand, while 





1 Just as public opinion polls operate on the assumption that a random sample of the population 
accurately reflects the beliefs of the larger nation. 
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often discussed in terms of chance, is revealed as a process with no random element. 
In the terms used here, it is simply a transcription of data. It may be data to 
which we do not feel we normally have access (what we might call our “stream of 
subconsciousness” ), or that feels somehow random in its organization, but that does 
not make the process random. Nor is the data. Quite the opposite, its shape is 
an insight into its source — the interior life that the Surrealists sought to explore. 
Similarly, while it is common to say of other Surrealist processes that their point is to 
“invoke chance, like Cadavre Exquis” (Ades, 1974, 57), the “chance” of the exquisite 
corpse is quite different from the random chance of Tzara or Shannon's processes. 

To put it another way, processes in which indeterminacy is a primary element are 
ways of providing a window on their data — and must be interpreted in terms of the 
type of view they provide, the specific data employed, and the context from which 
their data emerges and in which the audience will experience the resulting surface. 
As we have seen, these three archetypical “chance” processes differ vastly along these 
lines. 

Automatic writing is a transcription of data, presumably drawn from the mystical 
unconscious of the writer(s), and emerged from writers seeking an inward-looking 
alternative to the radical Dada. Newspaper poems give us each piece of the source 
data, once, in an order presumably determined in part by the ease with which paper 
scraps (words) of different lengths are grasped blindly in a sack. This draws audience 
attention to the words (their subject, tone, distribution, etc.) and the context of 
their performance (a post-war audience expecting poetic removal from the everyday 


and, instead, having attention turned, word by word, to the language of a particular 
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newspaper text). N-grams cast light on the lines of order that run through the data, 
the very lines cut by the newspaper poem process. But, as we will see later in this 
study, in allowing short subsets of these lines to recombine, text processes constructed 
along n-gram lines can both strengthen attempts to recognize traditional utterances 
and produce powerfully disorienting language. 

Of course, Shannon was not working in a literary or artistic context — but his work 
provided a process later used by many for creative work. How we can think about 
the creation of (potentially) literary processes, apart from any particular examples of 


work created with them, will also be one of the topics of the next chapter. 


Five Processes, Divided Three Ways 


In this chapter we have looked at the operations of five processes — two familiar 
(Strachey and Queneau's) and three newly introduced. Over the last century, each 
of these processes has been used as a way of generating text, and in recent years they 
have been discussed in similar terms. But the operations of choice and chance diverge 
significantly for the different members of this group. 

In one case, Queneau's, the processes that produce different texts are operated 
by the audience. The outcomes are determined by audience choice. This creates an 
experience quite different from the others, which simply present the audience with 
completed text. In the case of Queneau's work, the audience has to understand and 
carry out the processes. This becomes one of the most memorable things about the 


work. In the other cases, the audience may not even know that a process was involved 
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in producing the text. 

On the other hand, we have seen that this audience perspective is not the only 
fruitful way of viewing works such as these. Taking a critic?s perspective, the com- 
parison of Queneau's process with Strachey?s was quite useful to us in the previous 
chapter. And in this chapter we have seen how comparisons of digital and non-digital 
processes have formed the core of Espen Aarseth's influential critical work. 

For the purposes of this project — which seeks to enjoy the benefits of both com- 
paring and contrasting processes of these these two sorts — the term “interaction” 
serves as an important (if temporarily backgrounded) compliment to the terms “sur- 
face, data, and process.” In the remainder of this study, “process” will be used to 
refer to a work or author's processes, rather than those carried out by the audience, 
unless an exception is explicitly noted. 

The concept of interaction, however, also played a different role in this chapter. 
From the perspective of the audience the processes of Strachey, Tzara, Soupault and 
Breton, and Shannon may look essentially the same — though three are carried out by 
the author and one by a digital computer. The essentially similar appearance comes 
from the fact that they all take place at some time before the audience experiences 
the resulting surface. But in this chapter, with the help of Lev Manovich and Jesper 
Juul, we saw one important difference between processes carried out by the work and 
those carried out by the author. Processes carried out by the work, if they can be 
made to operate fast enough, can be carried out at the time of audience experience 
and enable interaction — such as the dynamic animation and rules enforcement of 


video games. 
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This, then, gives us three categories of process: those carried out by the audience, 
those carried out by the author, and those carried out by process defined within the 
work itself and automatically executed by the computer on which it runs. In terms 
of interaction, they occupy three different positions. The first is a potential result of 
interaction, the second has no role in interaction, and the third is a potential support 
to interaction (both in responding to audience interaction and creating the surface as 
a site for interpretation and interaction). 

In terms of indeterminacy, the five processes also can be seen to fall into three 
groups, though different ones. Strachey's, Tzara's, and Shannon's each employ what I 
refer to as “randomness” (and, as we will see in the next chapter, Florian Cramer calls 
“stochastic chance”) — though within processes that are, otherwise, structured sig- 
nificantly differently. This is the same sort of chance that we find in the roll of a die or 
the flip of a coin. Soupault and Breton's process, on the other hand, attempts to find 
indeterminacy of the sort Florian Cramer calls “philosophical-ontological chance.” 
This is the chance that anything might happen, not just one of a number of de- 
fined possibilities. Finally, Queneau's process employs the indeterminacy of audience 
choice, an approach that will receive continued attention throughout this study, in- 
cluding in the next chapter. 

Despite these differences, however, we have also seen in this chapter that it is 
possible to make a generalization about the class of text generation processes for 
which chance is primary: they don't all give the same results, but provide a particular 
window onto their data. The shape and context of this window provides a starting 


point for our understanding of such processes — and also helps us understand that it 
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means almost nothing to say that a work is “simply random.” We must ask: random 


in what way, with what data, and for what audience? 


Implementations, Logics, and Potential 


Understanding Generation 


Literary generators — poetry generators, story generators — are designed to produce 
many possible literary configurations from one work. 

This study examines a variety of generation projects: the art-oriented undertak- 
ings of Dada, Surrealism, and the Oulipo; the functionally-oriented explorations of 
computer science subfield of Natural Language Generation; the tinkering with im- 
plementations that brought n-gram text generation to microcomputers; and story 
generation projects emerging from within the frameworks of the artificial intelligence 
community. Through these we see what looks like literary work happening on a 
variety of levels. 

First, at the level closest to traditional literature, there are generators which, with 


their processes and data, can produce many different outputs. 


Second, it is also possible, in many cases, to substitute different data while leaving 
a generator's processes mainly intact. This means that a set of processes can 


be used as a framework for producing many generators. In fact, sets of literary 
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processes have even been presented without any particular data, as literary 


tools or toys for which many can provide data. 


Third, and finally, underlying the processes of many generators (whether presented 
with or without data) there are common approaches, common algorithms, that 
seem to have been identified as particularly fruitful for literary work. These 
processes must be implemented (made to operate) in a way that allows them to 
fulfill the goals of the current project — and at an earlier point someone must 


have identified these processes as potentially fruitful for projects of this sort. 


Certainly it wouldn't be useful to refer to all of these different levels of undertaking, 
all these efforts of different types, as “works of literature.” It would collapse too many 
important distinctions. But it would also make little sense to ignore all but one of 
these levels of work, or ignore all of them — concluding that only, say, individual 
outputs from generators are worthy of our attention (that is, returning to the surface 
as our sole site of consideration). 

This chapter begins by looking at how a particular process, the n-gram text gen- 
eration introduced by Claude Shannon (as discussed in the previous chapter), came 
into wider, and eventually poetic, use. The history includes contributions of different 
sorts, made by several individuals. It is a history that clarifies not only the way that 
different contributions are layered, but also that a process definition is not sufficient 
for making a process usable. An implementation's efficiency (e.g., with memory, with 
disk access, with processing power) and the need, or lack thereof, for certain kinds 


of correctness (e.g., in statistical distribution) can be determining factors in whether 
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a process can be used for particular purposes — even when the underlying resources 
(e.g., hardware, programming languages, compilers) remain stable. 

We then take the first steps toward developing some vocabulary that may be 
useful to us as we seek to understand these different levels — discussing the interplay 
between abstract and implemented processes, and the operational logics that underlie 
literary processes. The concept of literary logics is then explored in more depth, using 
the example of the computer role-playing game (RPG) Fable. In particular, Fable's 
ambitious processes for its non-player characters (NPCs) are considered, and how 
they employ similar logics for controlling animation and speech. 

The chapter's next section returns to a deepening of our treatment of the his- 
tory of process-oriented literary work, by discussing projects of the Oulipo beyond 
Queneau's One Hundred Thousand Billion Poems — especially Oulipian transforma- 
tional procedures. Finally, with the insights gained from our examinations of this 
work, the closing pages of the chapter give a preliminary outline of a broader set of 
concepts and vocabulary that may help us think more precisely about processes and 
better understand the relationship between different works. The proposed terms in- 
clude: implemented processes, abstract processes, operational logics, works of process, 


literary works, and surfaces. 


Abstract, Implemented, and Literary N-Grams 


In the previous chapter (page 131) Claude Shannon's method of n-gram text gen- 


eration was presented, along with a number of examples. The last example was 
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generated by flipping to a random page within a book, choosing a random word, 
writing it down, flipping to another random page, reading until the just-recorded 
word was found, writing down the word following it on this new page, flipping to 
another random page, reading until the newly-recorded word was found, recording 


the word after it, and so on. Here is that last example, again: 


THE HEAD AND IN FRONTAL ATTACK ON AN ENGLISH WRITER 
THAT THE CHARACTER OF THIS POINT IS THEREFORE AN- 
OTHER METHOD FOR THE LETTERS THAT THE TIME OF WHO 
EVER TOLD THE PROBLEM FOR AN UNEXPECTED. 

Given the process being used, the surprising coherence of this text — especially 
toward the beginning — is of interest. It shows that simple random processes can 
begin to reveal and recombine intriguing features normally hidden within the data 
on which they work. It requires only paying attention to some aspect of the data 
(in this case, ts local word order) and using this to drive the process. This sort of 
“emergence” from the application of simple rules is of the surprising and satisfying 
sort that interests many digital media practitioners (as opposed to the poor play 
that emerged from the limited “look ahead” at future moves of Strachey's checkers 
program). 

Of course, this sample is still quite rough, in part because only one previous word 
is being taken into account at any time. What Shannon refers to in his article (1948) 
as “further approximations” — for example, taking two, three, or more previous 
words into account — certainly gives more English-like results. Shannon pointed out 
correctly that, using the method of flipping through a book, the labor involved in 


creating such further approximations would be enormous. But the modern availability 
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of computing power, together with the publication of some smart approaches to the 
problem, has made carrying out such processes automatically a near-trivial task for 
reasonably-sized bodies of sample text. 

In fact, the modern availability of computing power has made practical, and shown 
the power of, a whole area of work with language that previously was out of reach. 
Shannon's n-gram technique is a specific instance of the more general class of “Markov 
chains.”* And Markov chains, in turn, are a one of the more popular “statistical” 


approaches to working with natural languages. 


Clarifying processes 


The wider field of Natural Language Processing will be discussed later in this study. 
For now, we will turn our attention to how Shannon's technique for text generation 
came to be used in works of digital literature. We won't be doing this simply out of 
curiosity about n-grams. Rather, n-grams will provide a relatively simple example 
process — carried out on what is, by today's standards, quite minimal computing 


hardware — that will allow us to clarify a number of important facts about processes: 


e A computational process can be defined on an abstract level, independent of 


any use. 





lAs Shannon was well aware. (He wrote, just after the discussion of the sample messages 
reproduced in the previous chapter, “Stochastic processes of the type described above are known 
mathematically as discrete Markoff processes and have been extensively studied in the literature.”) 
Shannon, however, was the first to bring this mathematics to bear meaningfully on communication, 
and also the first to use it to perform text generation. Parenthetically, it is interesting to note that 
the first application of Markov models was also linguistic and literary — modeling letter sequences 
in Pushkin's poem “Eugene Onegin” — though this was presented from a mathematical, rather than 
communication-oriented, standpoint (Markov, 1913). 


146 


For use to be made of a process, it must be implemented. 


Processes may be implemented in any computational form: human computa- 


tion, analog computation, digital computation, etc. 


An implementation of the same abstract process, using the same type of com- 
putation, and using resources of the same (or similar) power, may employ the 


available resources more or less efficiently. 


The efficiency with which resources are employed can determine whether it is 
possible to carry out the abstract process at all, whether it is possible to carry 
it out to certain extents, and whether it is possible to carry it out at a speed 


sufficient for certain uses. 


An implementation of a process can be correct or incorrect in certain respects. 
A less correct implementation may cause the process to fail in certain circum- 
stances, or cause it to provide sub-optimal results in certain circumstances — 


but a fully correct implementation may pay a penalty in efficiency. 


In cultures centered on making (and writing, computer science, and computer 
game development are all such cultures), an abstract process comes to be seen 
as appropriate for a certain type of work (e.g., literary work) through demon- 
stration. This demonstration is an example of what has been achieved by an 
implemented version of the process. Such a successful demonstration shows 


that a process supports a certain operational logic. 
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e Low-level operational logics can be combined to support those at a higher level 
of abstraction. The same logic (or collection of logics) can also be used to deter- 
mine the behavior of multiple elements of a system. This can be very powerful, 
but must be approached with care. Á process that successfully supports a logic 


for one type of data may not do so with another type. 


e Once an abstract process has been established as an appropriate support for a 
certain type of logic, a number of further eforts may follow. Individual projects 
may be created that use implementations of this process toward this logic. 
Tools may be created, or detailed specifications of implementations provided 
(e.g., source code), in order to enable experimentation with the process and /or 


the creation of further projects built on the logic. 


Some of these facts, if viewed from a computer science standpoint, may seem obvi- 
ous. But they have received almost no attention within the field of digital literature, 
despite the fact that they create the conditions of possibility for such work. Others of 
these facts — even if they operate in computer science, literary, and computer game 


cultures — are little discussed in any of them. 


Bringing n-grams to micros 


For almost three decades after the publication of Shannon's paper, the n-gram tech- 
nique of text generation that he outlined lay fallow. While the n-gram model as a way 
of understanding text was somewhat further investigated, there seems to have been 


little interest in generating text with it, and it seems to have had no literary uses. In 
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part this may have been due to the effort involved in building texts by hand as Shan- 
non did, and in part due to computerized exploration being constrained by at least 
three factors. First, within the culture of science and engineering, Shannon's method 
of using the text as the model of its own statistics appears to have been too imprecise 
— not sufficiently “correct.” The early computer projects involving Markov-chain 
text operated on the assumption that statistically-accurate models would have to be 
used. Second, even for severely restricted versions of the method, computers that 
could handle the amount of data that would be generated using accurate statistical 
models (at least when employing the most obvious approaches to the problem) were 
unavailable for such work until the 1970s. Third, less-obvious approaches, capable 
of considerable increases in efficiency for the same basic technique, had not yet been 
published. Given all this, the chances of a literary use being found for a computational 
version of Shannon's generation technique were essentially nil. 

This all began to change in the 1980s. The home computer boom was putting 
increasingly powerful hardware (“microcomputers”) into the hands of hobbyists and 
others interested in creative applications of computational techniques. This saw the 
rise of programming manuals aimed at non-mathematicians (using relatively high- 
level languages such as Basic and Pascal), the inclusion of sample program listings in 
relatively wide circulation computer magazines such as Byte, and the publication of 
information about creative computing in non-specialist magazines (e.g., the “Com- 
puter Recreations” features in Scientific American). In this environment, Shannon 's 
text generation approach went from virtually unknown to quite widespread in only a 


few years. And those who aided its spread consistently demonstrated n-grams using 


149 


literary material. 

The major step in the move to micros was likely Brian Hayes's November 1983 
essay for Scientific American's “Computer Recreations.” His piece, subtitled “A 
progress report on the fine art of turning literature into drivel,” features n-gram 
versions of Joyce, Shakespeare, Faulkner, James, Virgil (in Latin), Dante (in Italian), 
and Flaubert (in French). In fact, the only non-literary examples are a completely 
random text and an n-gram text generated from the Basic program used to generate 
the literary examples (he reports that, once he worked up to a character-based 7- 
gram, “a substantial number of statements could be executed without getting an 
error message from the interpreter,” p. 28). 

Hayes's essay followed the work of William Ralph Bennett, who had written about 
n-gram text generation in the fourth chapter of his 1976 textbook Scientific and 
Engineering Problem-solving with the Computer and then popularized the first section 
of that chapter for a 1977 essay in American Scientist. Bennett's work contains n- 
gram texts generated from Shakespeare, Poe, Hemingway, and and Roger Bacon's 
13th century Latin. Hayes was inspired by Bennett's writings, by his literary take on 
demonstrating n-grams, and in particular by the challenge of finding a better method. 


Bennett wrote, in his article: 
It is quite impractical at the present to attempt to predict what really 
would come out of the [n-gram process] if we were to extend it to sixth 


order with high resolution. Fourth order is about the practical limit with 
the biggest computers readily available today... (p. 701) 


This moved Hayes to write: 
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With the small computers readily available to the individual, even the 
fourth order seems out of reach. 


“Practical limits,” however, are created to be crossed, and when the prob- 


lem is considered from another point of view, the prospects are not so 
bleak. (p. 26) 


Four implementations of a process 


Bennett and Hayes (and later Hugh Kenner and Joseph O'Rourke, writing together) 
each proposed different approaches to implementing letter-level n-gram text genera- 
tion. By taking the time to examine the specifics of these implementations, we can 
begin to understand better the complex relationship between two parts of what have 
so far been discussed simply as “processes.” Processes exist both as ideas and as 
operating examples of those ideas. To put it another way, there are both abstract 
specifications of how a process operates and concrete implementations of processes. 
And while this may sounds like a neo-Platonic notion of ideal form and corrupt re- 
alizations, the history of n-gram text generation reveals a rather more interesting 


relationship. 


Bennett'”s algorithm 


Bennett's approach to n-gram text generation (roughly as outlined by Hayes) is the 
following. First, the number of possible characters is reduced to 26 upper-case letters 
(no lower case), the word space character, and the apostrophe (no other punctuation). 
The result is a 28 character set to be used for representing all the text involved. To 


generate 1-gram text (in which letter frequencies reflect those in the data) an array 
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of 28 elements is created.? Each element contains a number, initially zero. The data 
text is then read by the system, and each time a character is encountered the number 
in the corresponding element of the array is increased by one. Once the text has 
been read, the resulting array will have relatively high numbers in some elements 
(e.g., those for “E” and the word space) and relatively low numbers, perhaps zeros, 
in some other elements (e.g., those for “J” and “Z”). To generate the first letter of a 
l-gram text, a random number is generated between one and the number of characters 
in the data text. The number in the first array element (representing “A”) is then 
subtracted from the random number, and if the result is zero or less an “A” is added 
to the text being generated. If the number is still positive, the number in the next 
array element (representing “B”) should be subtracted — and so on until a letter is 
printed. Then the next random number is generated, and the next letter is printed, 
until the desired length of text is reached. 

To generate a 2-gram text, the procedure is somewhat different. Rather than a 
one-dimensional array of 28 elements, the system begins with a two-dimensional array 
of the same length in each dimension (“28 rows and 28 columns,” for a total of 784 
elements). When the data text is being read, the system must keep track of both the 
most recent character and the one that preceded it (the second-to-last). Moving down 


the first dimension of the array, the system goes to the element that corresponds to 





2Arrays are a common data structure in computer programming. An array is often thought 
of as a row of empty slots, or “elements,” each of which expects to hold the same type of data. 
The data could be numbers, letters, or other complex data structures. An array, for example, can 
be structured so that each of its elements holds another array. An array of arrays is called a “two 
dimensional” array, and an array of these (an array of arrays of arrays) is called “three dimensional.” 
There are often more efficient ways to address a problem than through use of high-dimension arrays, 
as the example of n-gram text demonstrates. 


152 


the second-to-last character read. Then, taking the array in that element (a second 
dimension array), the system moves to the element of that array that corresponds 
with the most recently read character. The number in that element is then increased 
by one. When text is generated, for each new letter only the second dimension array 


corresponding to the letter most recently printed is considered. As Hayes puts it: 


For example, if the preceding letter is a B, only the elements of the second 
row are taken into consideration. The largest element of the second row is 
E, and so it is the likeliest letter; A, I, L, O, R, S and U also have a chance 
of being selected. Impossible combinations such as BF and BQ have zero 
frequency, and so they can never appear in the program's output. (p. 24) 


Bennett's approach to generating 3-grams and 4-grams, as one might guess, in- 
volves extending this approach to three dimensional and four dimensional arrays. 
The four dimensional array, even with the severely limited 28 character set, has more 
than 600,000 elements — pushing the edges of what could be stored in the memory of 
then-current machines. It was for this reason that Bennett's investigations stopped 
at 4-grams. That is to say, Bennett's implementation made n-gram text automati- 
cally computable, and in a way that reflected the real statistics of the data text — 
but while the abstract process of n-gram text generation is applicable to any n, the 
specifics of Bennett's implementation (a combination of his program and the com- 
puters available to run it) hit a wall at 4-grams, greatly restricting what potentials 


of the abstract process it was possible to explore. 
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Hayes's contribution 


Hayes, however, saw an alternative implementation, one which would make higher- 
order text generation possible on microcomputers. In fact, the Hayes approach “pro- 
vides a means of generating random text of arbitrarily high order with a character set 
that spans the full alphabet and includes as well any other symbols the computer is 
capable of displaying or printing” (p. 26). Unfortunately, the penalty is that it gen- 
erates texts about ten times more slowly than Bennett's, due to an excessive number 
of “reads” of the data text. 

Hayes creates a one-dimensional array with as many elements as there are different 
characters in the data text (i.e., elements for all the types of upper and lower case 
letters, punctuation marks, and the word space). Then, during text generation, the 
previous n-1 characters added to the text (so, for a 4-gram generation, the previous 
three characters) are considered as a pattern. Then the entire data text is searched 
for instances of the pattern. Each time the pattern is found, the following character 
from the text is extracted, and the number in the element of the array corresponding 
to that character is increased by one. After this search for the pattern is complete, 
the next letter added the the generated text is chosen by the same method as before: 
successive subtraction of the numbers in the array from a random number. Then 
(to generate another character in the text) the elements in the array are returned 
to zero, the pattern is updated by removing the oldest character and including the 
one just added to the generated text, and the data text is searched for all instances 
of the pattern. This repeats until the desired length of text is acheived. With this 


implementation Hayes made the process of n-gram text generation available to home 
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computer users, and made movement to higher chain lengths possible for those in 
university settings. 

However, this implementation of the abstract n-gram process was remarkably slow 
— too slow to allow for the kinds of experimentation that might interest many people 
oriented toward the literary possibilities of n-gram text. This changed a year later, 
with the November 1984 publication of “A Travesty Generator for Micros” in Byte. 
This article?s authors, Hugh Kenner and Joseph O*Rourke, not only presented two 
methods that were faster than Hayes's — they also published a complete listing of 


the source code for their system, Travesty, in the programming language Pascal. 


Travesty and Hellbat 


Just as Hayes had recognized a weakness in the central part of Bennett's implemen- 
tation (the creation of a crushingly large number of array elements, most of them 
empty), so Kenner and O'Rourke recognized a weakness at the core of Hayes's ap- 
proach: the entire text must be read through each time a letter is added. This is the 
primary reason Hayes's implementation is so slow, and most of the time is wasted on 
checking patterns that don't even start with the right letter. Travesty's solution is 
to create an array the same length as the data text, which the authors call SkipAr- 
ray. Each element of SkipArray holds a number, and that number identifies the next 
element in the array that corresponds to the same letter. So, for example, if there is 
a “p” as the seventh character in the data text, and the next “p” is the eighteenth 


character in the data text, in SkipArray the number in the seventh element will be 


eighteen (and the number in the eighteenth element will point to the element cor- 
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responding to the next “p”). When searching for matches to the n-gram pattern, 
Travesty looks at the first character of the pattern, and then consults a small array 
called StartSkip — this array, indexed by characters, contains the first location of 
each character in SkipArray. The data text itself is read into an array called BigAr- 
ray. As each location of the pattern's starting character is read out of SkipArray, the 
appropriate location in BigArray is checked to see if the rest of the pattern is present. 
Because Travesty doesn't have to read through the entire text, but can skip to the 
locations where the first letter of the pattern is present, the overall system is about 
seven times faster than one implemented as Hayes suggests. As Kenner and O'Rourke 
put it: “Scanning a small input file on a fast system, we've seen new characters pat- 
ter onto the screen faster than we could read them. The analogy is no longer with 
molasses but with raindrops” (p. 466). They report that long text generation tasks, 
which once took half an hour, are with Travesty accomplished in five minutes. 

But an even faster option was available. Remember that Shannon, who first 
suggested the n-gram approach to text generation, didn't build up any representation 
of which patterns were most common — didn't create what in Natural Language 
Processing is called a “statistical language model” — he let the text be its own 
model. Kenner and O'Rourke created a version of Travesty that worked this way, 
just jumping into BigArray at a random point and searching linearly until a match 
for the pattern was found (wrapping around to the beginning if no match is found 
between the entry point and the end). This increased the system's speed by better 
than three times, taking the output from raindrops to a torrent, and earned the 


new version the name Hellbat. But they didn't dispose of the SkipArray version of 
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Travesty in favor of Hellbat. They explain why with this example: 


Let's imagine an input that contains, midway, the sequence “... silk stock- 
ings and silk hats...”, and no other occurrences of “silk”. Suppose the 
pattern we are matching is “silk_”. The chances are good that a random 
pounce will land either before that sequence or after it. If before, the 
“silk” will be followed by “stockings”. If after, then wrap-around will 
carry us around to before, and “silk” will again be followed by “stock- 
ings”. Only if the jump chances to land in the short interval between the 
two occurrences of “silk” can the following word ever be “hats”. So the 
output can settle into a tedious loop, “silk stockings and silk stockings 
and silk stockings and ...” 
Of course, when real statistics are used — as in Travesty, rather than Hellbat — 


the chances of “silk stockings” and “silk hats” are equal. 


Processes, abstract and implemented 


Following the publication of Kenner and O'Rourke's article, Byte played host to a 
variety of follow-up comments from readers, each suggesting an alternative, often 
more efficient, scheme. As a result of this and other developments, by the mid-1980s 
n-gram text generation was firmly in circulation among microcomputer users. A 
related approach (sometimes called “Dissociated Press” after the version built into 
the Emacs text editor) reads from the data file for several continuous characters 
before searching for a pattern match, at which point it jumps to another point in the 
text (where the pattern appears) and reads for several continuous characters before 
searching for another pattern match.? Also, n-gram text generation can be carried 


out at the level of words, rather than characters, using the same basic techniques. 





3 As the Emacs manual puts it, “This makes for more plausible sounding results, and runs faster.” 
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Words are “tokenized” so that they can be treated as individual units, the way letters 
are by Travesty. This was the basis of a textual toy for MS-DOS, Babble! — and is 
available to users of modern operating systems in Brion Moss's prate program. 

The point of the story of Bennett, Hayes, Kenner and O'”Rourke, and further 
developments is not, however, mainly to give a history of the coming of n-gram text 
to microcomputers. Rather, as suggested earlier, it is to point out that we should 
further refine the way we have been discussing processes. It is true that the process 
for generating n-gram text was defined by Claude Shannon in 1948. But it might 
actually be more useful to think of Shannon's definition as having two parts. The 
first part was the definition of an idea: to generate a text, one looks at the preceding 
n letters (or words) and chooses the next based on the frequency with which different 
letters (or words) follow those. The second part of the definition was the specification 
of an implementation of that idea: flipping through a book to a random place, looking 
for the pattern until it is found, recording what follows, flipping to a new random 
place in the book, etc. 

The idea of a process, the abstract mode of operation of a process, has a complex 
relationship with a process's implementations. As we have seen with the example 
of n-gram text generation, different implementations can be vastly different in their 
use of computing resources. Differences of this sort can make types of work possible 
or impossible. This is not simply because one may run out of memory (as with 
Bennett's implementation) but also because, for example, the system may simply be 
too slow for certain desired uses (as with Hayes's implementation). This is similar 


to Lev Manovich's observation, quoted in the previous chapter (page 114), about the 
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relationship between computer rendering and the experience of interactive navigation. 
When the computer can carry out an operation fast enough, even one that it has 
been able to carry out (more slowly) for many years, a change in quantity can enable 
qualitatively new experiences. 

In addition, the different implementations of a process (which embody different 
choices about how the elements of the abstract process are carried out concretely) 
can lead to differences that aren't best understood in terms of speed (or efficiency 
in general). For example, Shannon's implementation and Hellbat are both suscepti- 
ble to the type of difficulty highlighted by Kenner and O"Rourke's example of “silk 
stockings and silk hats” — while Travesty and similar implementations are not. One 
is statistically “correct,” while the other uses the text as a model of its own statistics. 
Both approaches can be understood as implementations of the same abstract process 
for text generation, but nevertheless certain users (employing certain data) might 


regard Travesty's results as interesting but Hellbat's as unacceptably repetitious. 


Literary n-grams 


Those who brought n-grams to microcomputers couched their discussions in terms of 
literature. The demonstrations printed with their articles were largely derived from 
literary data texts. They talked about how the “texture” of the approach to language 
of different literary authors could be discerned even using patterns of only three or 


four characters.* 





*This seems to have begun with Bennett, who formulated statistical text generation in terms of 
the famous image of the great works of literature being produced if “enough monkeys were allowed 
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There is also overlap in the literary authors whose text is used in generating n- 
gram demonstrations for the articles. For example, Kenner and O"Rourke, like Hayes, 
include a sample of n-gram Joyce. Kenner and O'Rourke's is a 4-gram text generated 
from a 75-word passage of Ulysses. The sample begins: “Gleaming harnesses, petti- 
coats on slim ass rain. Had to adore. Gleaming silks, spicy fruits, pettled. Perfume 
all” (p. 460). 

Unlike Hayes, Kenner and O'Rourke also show an interest in poetry”s line breaks, 
which they preserve from the initial text by recording the “|” character at the point 


of each break. When this character is produced during text generation, a line break is 


printed instead. Their sample of text generated from Eliot's The Hollow Men begins: 


We are not appear 
Sight kingdom 
These do not appear: 


This very long 
Between the wind behaving alone 


And the Kingdom 
Remember us, if at all, not 
appear 
Sightless, unless 
In a fading as lost kingdom 


It may be that this text and the other examples printed with influential articles 


about n-gram generation are of literary interest. Just as Shannon's choice of data 





to pound away at typewriters for enough time” (p. 694). He noted, “The fourth-order Shakespearean 
monkeys have a poetic flavor reminiscent of the Bard himself, whereas the fourth-order Poe monkeys 
make various involuted remarks about skulls and gools and showed an unusual interest in the kind of 
long and complicated words that Poe himself probably would have liked (e.g. CONCLUDELINESS, 
COMPANTABLIGHT, and STRABLEMEMBER)” (p. 700). 


160 


produced a “frontal attack on an English writer” that few other books would have 
made possible, so Kenner and O'Rourke's choice of data gives us “Gleaming harnesses, 
petticoats on slim ass rain” that one feels few but Joyce might have produced through 
the process of letter-level 4-grams. Thinking through the implications of this may 
provide some literary insight. 

But these examples are not presented as works of literature, they are intended as 
demonstrations, as illustrations of the fact that n-gram text can capture something of 
the literary in language. It remained for others to use Travesty, or similar processes, 
to create works of literature. Charles O. Hartman, the poet and digital literature 
theorist mentioned at the opening of the previous chapter's section outlining three 


chance processes, is one who took this step. 


Monologues of Soul and Body 


In his 1996 book Virtual Muse, Hartman discusses one of the works he has created 
with Travesty, “Monologues of Soul and Body.” The impetus of this project, for 
Hartman, came when he was experimenting with the code for Travesty. He began 
to experience a somewhat different version of a common reaction to combinatory 
processes: humor and awe. 

In the common version of combinatorially-induced humor and awe, the humor 
is at specifics (at the ridiculous combinations that can result) while the awe is at 
the vast possibilities (most commonly expressed, in critical discourse, by the perhaps 
over-used phrase “combinatorial explosion”). While Hartman found a similar source 


of humor, his source of awe was somewhat different, and more specific to the n-gram 
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process. In describing his reaction, he begins by pointing to the “childish pleasures” 


promised in Travesty's name: 


It's the wickedness of exploding revered literary scripture into babble. We 
can reduce Dr. Johnson to inarticulate imbecility, make Shakespeare talk 
very thickly through his hat, or exhibit Francis Bacon laying waste his 
own edifice of logic amid the pratfalls of n = 9. 


Yet the other side of the same coin is a kind of awe. Here is language 
creating itself out of nothing, out of mere statistical noise. As we raise n, 
we can watch sense evolve and meaning stagger up onto its own miraculous 


feet. (p. 57) 


This combinatoric awe is not at vastness, but at order, and at simplicity. It is 
an awe at constrained combinatorics that take huge numbers of inchoate possibilities 
and narrow them to a few comparatively coherent possibilities simply through the 
application of relatively uncomplicated rules. Or, put another way, it is awe of the 
simple abstract process of the n-gram which, rotely applied, can produce the utterly 
senseless (beyond our unassisted powers of nonsense, and yet somehow recognizably 
following the patterns of our language) and then build up toward something like 
sense. This simplicity is perhaps an inversion of that of the Turing machine — a 
handful of simple procedures upon which great, coherent complexity can exist. And 
it happened that, at the time of Hartman's experiments with Travesty, he was also 
working on a poem that took Alan Turing as a subject, and specifically his famous 
“imitation game” (an important formulation of artificial intelligence, which will be 
discussed further in this study's last chapter). Turing had his place in the poem 
because of its “interweaving obsessions: chess, computers, war” (p. 58). 


Hartman took the poem he had written and ran it through his version of Trav- 
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esty? at eight different chain lengths: n=2 through n=9. The results were certainly 
evocative of his themes, as computerized processes seemed to build sense that wove 
through text about human and computerized sense making, paired with real and sym- 
bolic war. And perhaps Hartman could have produced a piece that consisted only 
of this: a version of Travesty, bundled with his source texts, and with instructions 
to run the program eight times at progressively longer chain lengths. The instruc- 
tions would have provided the larger structure of progression that was the foundation 
of Hartman's interest in employing n-gram text in the poem (but which the simple 


n-gram operation, on its own, cannot provide). 


Composing n-gram text 


However, rather than attempt to defer structure to the audience's time of reading 
(and instruction-following), Hartman chose to use a more traditional tool of poetry: 
the printed page. Here he performed the dialogue between his traditionally-authored 
text and the Travesty-generated text as an alternation. Of course, “dialogue” is 
probably the wrong term for this relationship, as Hartman's title points to the idea 
that soul and body are speaking individually, rather than conversing with each other, 
and in the conception of the poem Hartman's original text plays the role of “soul” 
while the n-gram text is its “body.” This makes the body a product of rearranged 
soul, an intriguing and strange image. As Hartman puts it, “In the computer output 


I saw the body constructing itself out of the material of soul, working step by step 





Which he had customized to keep producing output until a period or question mark coincided 
with the end of a line. 
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back to articulation and coherence. It's a very Idealist poem, and at the same time 
very Cartesian, and perhaps monstrous” (p. 62). Or, as C. T. Funkhouser puts it 
in Prehistoric Digital Poetry: An Archaeology of Forms 1959-1995 (forthcoming), 
“As the poem progresses, and the “body” text is less abstract, the author succeeds in 
creating parallel monologues in which one (“body”) borrows from the other.” 

In addition to taking advantage of the printed page as a venue for higher-level 
structure (and context) lacking in straight n-gram text generation, Hartman also takes 
advantage of the page to make selections. Rather than simply use the first “body” 
texts produced by his version of Travesty, Hartman spent “several long days” (p. 64) 
reading different outputs before making his selections. The text of “Monologues of 
Soul and Body” bears signs of this, especially, again, at the level of structure. For 
example, the first sentence of Hartman's “N = 5” stanza is “Possible word.” — while 
the first sentence of his “N = 6” stanza is “Possible world.” The appearance of these 
two similar short sentences somewhere within the “body” of output might seem, to 
the reader, like happenstance. Their prominent structural placement, however, even 
if it were chance, would not read that way. Instead, it feels like a reassurance that 
the author's hand is present even in these “randomly generated” texts, at least at the 
level of selection and placement. 

That is to say, in the end, Hartman produced a relatively traditional poem, even 
if it incorporated a form of text rarely seen. Certainly there's nothing wrong with 
this. What we think of as “traditional poetry” is actually the outcome of a series of 


adaptations of once-unfamiliar textual forms. As Hartman puts it: 


A new form begins as someone's invention and then, maybe, proves useful 
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to other poets. We know one Greek form as the “Sapphic stanza.” It 
may have been worked out originally by her contemporary, Alcaeus, but 
it was a favorite of Sappho's, and she used it in writing some magnificent 
poems. We still give it her name, though for thousands of years poets 
from Catullus through James Merrill have been using it — and varying 
it. The differences between Greek and a language like English reguire 
some variations.... These variations in a form can work like mutations in 
biology, providing the material for evolution. Ezra Pound claimed that 
the sonnet began as someone's variation on the older canzone. (p. 101) 


And this observation, in turn, leads Hartman to a question: 


Will certain computer poetry methods catch on and establish themselves 
and evolve in similar ways? Or will we shift the demand for originality in 
poems toward meta-originality in method? Another decade or two should 
tell us. 


It's been a decade since the publication of Hartman's book, and we can give a 
preliminary answer to his question: both. Experiments with n-gram text, for example, 
continue — and continue to evolve — while the development of “new” methods (many 
with pedigrees as long, or longer, than that of Markov chains) also continues. 

But there is another question that follows from Hartman's: How do we think 
of these different forms of creativity? Consider the layers of creativity, by different 
individuals, that stand behind Hartman's “Monologues.” There is Markov's original 
formulation; Shannon's use (three decades later) of this abstract process for text gen- 
eration; Bennett, Hayes, Kenner, and O'Rourke demonstrating (more decades later) 
that the linguistic operation identified by Shannon could create intriguing results 
with literary texts; Hartman's decision to take that literary operation and employ 
it in his own work; and the innovations in implementation of Bennett, Hayes, Ken- 


ner and O'Rourke, and Hartman himself. While we could refer to all of these as 
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“meta-creativity,” the situation seems to call for a more fine-grained set of concepts 
and vocabulary. This chapter's next section will make some preliminary suggestions 
along these lines. 

Before moving on, however, it is in addition worth noting that n-grams are also 
employed in works for which audience interaction is primary. John Cayley?s work in 
this area, particularly with the word-level digrams that he calls “collocations,” is an 


early and influential example. 


Operational Logics 


Spacewar!, created on the PDP-1 computer at MIT in the early years of the 1960s, 
was the first modern video game (figure 4.1). Two players each had a custom-made 
controller, which they used to steer the flight of a virtual spacecraft on the PDP-1's 
CRT. The spacecraft were both pulled toward the star at the center of the screen by 
simulated gravity, and could fire projectiles at one another. A spacecraft hit by the 
central star or a projectile would be damaged. 

Many video games still work with this same basic vocabulary today (e.g., naviga- 
tion through a space with simulated physics, firing projectiles, etc.). This vocabulary 
is built on a set of basic logics, or operational elements.* One of these logics, familiar 


to everyone, is gravity. A logic that is less well known is “collision detection.” Grav- 





ST first discussed this notion of operational logics in an essay titled “Playable Media and Textual 
Instruments” (2005). Since then my thinking has been influenced by reading lan Bogost's Unit 
Operations (2006). 
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Figure 4.1: Playing Spacewar! 


ity's logic is that things fall toward gravity's source.” In most games this is the bottom 
of the screen, though for Spacewar! it was the central star. Collision detection's logic 
is that objects don't just pass through each other. This supports navigation — for 
example, that players can't walk through walls. It also supports firing projectiles 
— when a projectile hits something, that's collision detection. Gravity and collision 
detection work together in games that involve walking and running on surfaces at 
different “heights” in the game world, especially those that make jumping between 
these surfaces a focus of gameplay (such games are sometimes called “platformers” ). 

While logics such as gravity and collision detection are quite consistent across 
many games, there is variation in the abstract processes used to make them opera- 
tional. For example, an abstract process for collision detection would be different for 
a game in two-dimensional space as opposed to three-dimensional space. And there 
is quite wide variation in the specific implementations of the abstract processes for 


supporting these logics. The implementations differ not only along obvious technical 





"In most computer games, in any case. Obviously, the model of gravity in more realistic physical 
simulations involves attraction between all masses. 
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directions (implementations for different game platforms) but also by design. The 
model of gravity, its specific behavior, is different for a game like Super Mario Broth- 
ers than for Spacewar! It will be implemented differently, even if it follows the same 
abstract process for modeling gravity. 

It is also worth noting that the game industry divides up the work of creating a 
game in different ways, which reflect these structures. A set of implemented logics 
that define most of the operational vocabulary of a game can be bundled together 
as a “game engine.” Alternately, an implemented set of logics of a particular sub- 
type can be marketed independently to game developers (e.g., software libraries for 
simulated physics). Game players recognize certain arrangements of implemented 
logics as defining game genres (e.g., “side scrollers” and “first-person shooters”). If 
the arrangements are too similar, we may see one game as a “re-skinning” or imitation 
of another (e.g., the many games similar to Tetris and Pac Man — or even the many 
variations bearing those two names). Of course, similarities along these lines (together 
with continuing characters and other data-defined “intellectual property”) are part 


of what makes game sequels marketable. 


Linguistic and literary logics 


Graphical logics are not the only logics. The first popular computer games, for exam- 
ple, were purely textual: the “interactive fictions” produced by Infocom and others 
(which were discussed briefly in an earlier chapter). These textual games used a differ- 
ent operational vocabulary from video games, supported by different logics, including 


arguably-literary logics that operate at relatively abstract levels (which others have 
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discussed in terms of the literary riddle (Montfort, 2003) and the performative quest 
(Aarseth, 2004)). These more abstract logics had to be supported by lower-level ones, 
including those for textual parsing and reply, world simulation, and progress tracking. 

Video games, too, often operate using logics beyond the purely graphical and 
at varying levels of abstraction. The high-level logics of video games can include 
the riddle or quest logics that appear in interactive fictions, but we also find high- 
level logics in some of the graphical vocabulary of video games. For example, as 
noted earlier, navigation through virtual space can be seen as a logic that operates 
at a relatively high level of abstraction, often supported by lower-level logics such as 
collision detection and gravity. 

In a different field, we can understand part of the work of the Natural Language 
Processing community as the identification and implementation of non-graphical log- 
ics — specifically, linguistic logics. Techniques such as sentence grammar expansion 
and prediction based on n-grams have been employed as abstract processes to sup- 
port particular linguistic logics. When these abstract processes are brought from the 
linguistic domain into literary work, it is because they are understood to potentially 
support a literary logic. Of course, not all such adoptions of logics into the liter- 
ary domain come from the direction of linguistics. This study?s discussions of story 
generation will show that its history includes a significant strand of work seeking to 
employ “planning” as a literary logic. 

From this set of observations we can add another concept, and provisional term, 
to our expanding view of processes. In addition to abstract processes (specifications 


of how a process operates) and implementations (concrete realizations of abstract 


169 


processes) we have operational logics. A logic is an element of a system, supported 
by an abstract process or lower-level logics, that is understood to embody an appro- 
priate behavior of that system toward a particular end. When I say “understood” it 
may sound as though I'm speaking of a purely theoretical move. But in communi- 
ties centered on practices of making — and writers, computer scientists, and game 
designers are all members of such communities — the required circumstances for es- 
tablishing such an understanding include demonstration. When Shannon showed that 
Markov chains can support a linguistic logic (though accomplishing this was more of a 
side-effect of his argument than a primary point) he did so by demonstration, provid- 
ing examples and specifying the implementation of the process that produced them. 
When Bennett, Hayes, Kenner, and O'Rourke established that this linguistic logic 
can serve as a literary logic, it was through a similar sort of demonstration. 

It is also possible, of course, to demonstrate that a process can support more than 
one logic. N-grams have been used to support linguistic logics, but have also been 
used in systems for music composition. In fact, n-gram processes appear to be able 
to operate in a way that is evocative of almost any serially-ordered phenomenon. 

Also, as discussed above, the behavior of an element of a system can be produced 
by the simultaneous workings of multiple logics. This simultaneity is relatively simple 
for operational logics such as the navigation of virtual space, but more complicated 
in terms of logics such as the quest. The next section will provide an example of 
this more complicated operation of simultaneous logics, and discuss the relationship 
of these operations with writing. Specifically, it will discuss the logics underlying the 


behavior of non-player characters (NPCs) in the 2004 computer game Fable. This 
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game's NPCs have both graphical and linguistic behavior which is produced by lower- 
level logics supporting a combination of high-level combat logics, story /quest logics, 


and world simulation logics. 


The logics of Fable's NPCs 


Fable was one of the most anticipated games of 2004. There are a number of reasons 
for this. The game designer (Peter Molyneux), the lead writer (James Leach), and 
a number of other team members had worked together on previous projects (e.g., 
Black and White, Dungeon Keeper) that had been seen as innovative and enjoyable. 
Second, the concept of the game caught the imagination of players and the game 
press: combining a traditional computer role-playing game (RPG) with a more richly 
simulated world. Third, Molyneux was immensely successful at generating interest in 
the game through his public speaking and previews with the game press. A particu- 
larly widely-cited example is his speech at the 2002 Game Developer's Conference — 
when Fable still went by the development name Project: Ego — in which Molyneux 
stated: “I reckon that Project: Ego is going to be the greatest role-playing game of 
all time, which is insane. I could say the second greatest, I could say quite good, I 
could say, “Hmmm, it's quite nice,” but I'm going to say greatest [RPG] of all time” 
(XBN, 2004). 

The promise of Fable's concept was mainly couched in terms of the player character 
(Conrad, 2001; Goldstein, 2002). This character, controlled by game players, would 
begin as a young boy and grow to be the greatest hero in the world of Albion, and 


from there grow into old age. In the course of play, the character?s decisions would do 
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more than impact the current score. Being outside would give a tan and hasten skin 
aging, using magic would eventually cause eyes to smoke and hair to recede, running 
would cause larger and more powerful leg muscles, combat wounds would produce 
scars, and so on. In addition, the game would interpret the player character?s actions 
from a moral perspective, and this would be seen in the simulation: the character's 
appearance would change as a reflection of their good or evil decisions, and ordinary 
citizens would greet the player character with joy, or run in terror, or react with 
disdain, or perhaps not yet even know who he is. The player character would be able 
to engage in marriage, plant acorns that would grow into trees, compete with other 
heros, and — in the end — save the world. 

Reaction to Fable, after its release, was quite mixed. It was positively reviewed 
in the game press, and regarded as an enjoyable RPG (if somewhat short in hours 
of gameplay) or, perhaps, a strong entry in the more linear form of the “adventure 
game.” On the other hand, Fable was widely debated by players. Some derided it for 
not living up to its hype — many of the promised features were not delivered, and 
those present weren't quite as imagined or described. Other players defended Fable 
staunchly as a great game, even if a different one than that promised in the days of 
Project: Ego. 

Here we'll examine one particular facet of Fable's ambition, accomplishment, and 
disappointment: the behavior of its non-player characters. Depending on the circum- 
stances, one of Fable's NPCs can be controlled by combat logics, story /quest logics, 
or world simulation logics — operating at both graphical and linguistic levels. For 


example, a quest in Fable might require getting past a point guarded by a gatekeeper. 
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When under control of the story logic, the gatekeeper presents scripted dialogue and 
animations that represent what the keeper wants the character to do (e.g., present 
the keeper some item, which it may require a particular quest to acquire). If the 
player character's actions remain within what is anticipated in the story script, this 
logic remains in control. But if the player character does something unanticipated 
from among the actions made available at the player's interface — e.g., threatening, 
farting, changing clothes into something ridiculous — the gatekeeper”s graphical and 
linguistic responses are determined by the logics of the world simulation. And if 
the player character initiates a fight with the gatekeeper, perhaps to force his way 
through the gate, the keeper”s actions are determined by combat logics. 

This mixing and multiple use of logics — including logics driven by processes of 
random selection much like that in Tzara's newspaper poem — has both anticipated 


and strikingly unanticipated consequences, especially at the level of language. 


Story logics and language 


As mentioned earlier, James Leach led the team of writers working on Fable. This 
work included shaping the overall story, providing language and animation cues for 
the non-player character interactions driven by story logic, and crafting many atomic 
bits of speech that could be uttered by NPCs when controlled by the world simulation 
logic. These types of writing were integrated, via interactions with Fable's system 


programmers, into the logics controlling the experience at different levels.? This 





8The following information is garnered from personal communication with Leach and experience 
playing Fable. 
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section will present a short example of the way that language was made to fit into 
Fable's story logics. 

Early in the game's development, the writing of Fable began with a short story. 
The story set up one prototypical version of events, broken into scenes. These scenes 
were then broken down further, with the dialogue pulled out and bullets for any 
important physical actions (those for which, in the eventual game, special animation 
would be required). There was not yet — in this version — any operational logic that 
could produce multiple experiences, but rather one prototypical linear path through a 


story and its world. A sample from the “arena entry” scene originally read as follows: 


e A GUARD bangs on the door with his fist. 


GUARD ONE 
Open up! There's a new sandmeat delivery. 


e The gate opens, you both walk in. 


e You are escorted by two guards. 


GUARD TWO 
Oh look. A hero. Tm so excited. 


GUARD ONE 
Welcome to the Arena. Where the Heroes fight. And die. 


GUARD TWO 
Have you been nominated? Good. A couple of things you need to know. 
One, don't attack the heroes fighting alongside you. Two, only kills earn 
you money. Any bleeding heart compassion and you walk out broke. 


GUARD ONE 
Yeah. Like in a hundred pieces. 


e He opens gate at the end of Hall of Champions. 
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GUARD TWO 
And this will be your palace for the duration of your stay. 


e Guard opens cell door and you step in. 


GUARD ONE 
Good luck, sandmeat. 


This text was, essentially, in script form. It was then passed to Fable's system 
designers and animators. The goal of the system designers was to take the proto- 
typical story represented by the script and convert it into a system that, following 
traditional adventure or quest conventions, could produce many possible subsets and 
sequences of interactions that would successfully tell the story. In other words, the 
system designers implemented a set of processes that could support the story logic. 

In Fable the high-level logic of the story is supported by a logic of quests. Some 
quests are optional, while others are required to move forward in the story of the 
development of the player character and the world. Quests, in turn, are composed 
of actions — some of which are required to complete the quest, some of which are 
optional, and some of which can be carried out to varying degrees. These actions 
are situated in Fable's world, which also presents opportunities for actions that are 
not tied to particular quests, but some of which may be helpful in later quests. For 
example, players can take non-quest actions to acquire money or hidden “keys,” both 
of which can help in the acquisition of more powerful equipment, which can help in the 
completion of quests. Finally, quest actions can require interaction with non-player 
characters who have roles in the story. From the point of view of the quest logic, 


the most important function for these non-player characters is as part of the quest 
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machinery — providing information about what is required on the quest, providing 
objects or access to spaces required to complete the quest (often only after the player 
has completed other actions required before passing that point), or themselves being 
objects of the quest (e.g., when a quest requires defending or defeating particular 
non-player characters). 

In the process of turning the original Fable short story into a system driven by 
these logics, the system designers also replaced the original dialogue written by Leach 
and his team with placeholder dialogue focused on the roles of characters within 
these logics. From this point of view, the main function of the arena guards is 
twofold. First, to explain to characters that they are not able to pass into the arena 
portion of the story until they have completed the quest that results in them being 
nominated. Second, to transfer players into the arena holding area if they decide 
to enter after having been nominated. These functions are supported by a logic of 
simple conditional interactions between the player character and the guards, with 
voice actors recording the portions performed by the guards. The system designers 


passed this portion back to the writing team in the following form: 


TEXT QST 005 GUARD ENTRY CHECK_10 
GUARD 
Welcome to Arena. Heroes fight here etc.. 


TEXT_QST_005_GUARD._ENTRY_CHECK_20 
GUARD 
You need to be nominated. And Guild Seal won't work. 


TEXT_QST.005_GUARD.ENTRY_CHECK_FAIL 
GUARD 
Not nominated yet! 


TEXT _QST 005 GUARD ENTRY CHECK PASS 10 
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GUARD 
Been nominated now. Well done. 


TEXT_QST_005GUARD._ENTRY_CHECK_PAS5_20 
GUARD 
Guard will show you in. 


TEXT_QST 005 GUARD._ ENTRY _CHECK_PASS_30 
GUARD 
You could die in here. 


TEXT_QST.005GUARD._ENTRY_READY 
GUARD 
Go in? Yes/No. 


The job of the writers, at this point, was to weave back into these lines (which were 
tied into the story, quest, action, and conditional interaction logics) some semblance 
of the texture of language from the original story. The writers had to as much 
as possible work within the structure created by the system designers: only using 
lines with existing numbers and tags, leaving questions and responses in the places 
indicated, and so on. As Leach wrote in personal correspondence, “New lines can be 
added to explain something further or enrich the atmosphere, but as a default we try 
to work within the lines we're given.” 

In this structure the lines that are sequential end with a multiple of ten (10, 
20, etc). When lines needed to be added, a technique was used that is familiar to 
those who have used older programming languages that require line numbers (e.g., 
Basic): simply adding up to nine lines between numbered lines. Also, multiple lines 
for the same circumstance were numbered in this latter way. These multiple lines are 
selected between, when Fable is played, by random choice — presenting a “sack of 


lines” much like Tzara's “sack of words.” This random selection between lines was 
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used to try to keep story events that can happen multiple times from triggering the 


same line each time, repetitiously. For example: 


TEXT _ SHOPKEEPER_ROBBED- 01 
SHOPKEEPER (ANGRY) 
Stop! Thief! 


TEXT_SHOPKEEPER-ROBBED 02 
SHOPKEEPER (ANGRY) 
Hey! You stole that! 


TEXT _SHOPKEEPER_ROBBED 03 
SHOPKEEPER (ANGRY) 
Come back with that! That's my stock! 


In these situations the writers not only worked to create language that functioned 
well within the structures, but also to indicate the types of general-purpose animations 
that NPCs should use when delivering their lines. These were specified by emotion 
or attitude tags, which were placed next to the narrator's name. Common tags 
indicate emotions such as fear, sadness, happiness, pride, annoyance, and anger. 
(The shopkeeper is angry when speaking all of the lines above.) These determine 
body language and other choices made by the animation engine, and in Fable they 
slowly decay over about ten seconds unless refreshed or replaced by another tag. In 
other words, the structure of behavior determined by the story logic simultaneously 
triggers linguistic and graphical behavior on the part of NPCs as selected by the 
writers (and as produced by voice actors and animators). 

This, then, is the revised dialogue for the arena entry scene, rewritten by the 


writers and with emotion tags added: 


TEXT _QST 005 GUARD ENTRY CHECK_10 
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GUARD (ATTITUDE_PRIDE) 

Welcome to the Arena. This is where Heroes battle to become legends. 
TEXT_QST_005GUARD._ENTRY_CHECK_20 

GUARD 

You need to be nominated before you are allowed entry. And note that 
your Guild Seal won't work in here. 
TEXT_QST_005GUARD._ENTRY_CHECK_FAIL 

GUARD (ATTITUDE._ANNOYANCE) 

You can't come in without a nomination card. You think we just let 
anyone in? 

TEXT_QST_005GUARD._ENTRY_CHECK_PASS_10 

GUARD 

Ah, a nomination card. You must have done something pretty special to 
earn this. 

TEXT_QST.005GUARD. ENTRY _CHECK_PAS5S 20 

GUARD 

The Guard on the other side of the door will show you in. 
TEXT_QST.005GUARD. ENTRY _CHECK_PAS5_30 

GUARD (ATTITUDE SADNESS) 

Take a good breath before you enter. You might not see the outside world 
again. 

TEXT_QST_005 GUARD. ENTRY _READY 

GUARD 

You ready to go in now? 


As Leach puts it, “It's worth noting that the dialogue in the writer's version 
two of the script bears fairly little resemblance to the original scene.” While the 
original story shaped the game, the dictates of the game logics created the context 
for all the final language that is included in Fable. “This process was carried out, 
and then checked, and then voice acted, for the more than 200 speaking characters 
in Fable's story. In many ways this follows standard practices in game development 


today, though two important differences are worth mentioning. First, beginning game 
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development with an initial short story and script was a significant win for Fable's 
writers, as it is more common to bring writers in toward the end of the development 
process. Second, the same voice actors who recorded story-logic dialogue for a number 
of Fable's NPCs also recorded dialogue controlled by Fable's world simulation logics, 
making possible one of Fable's significant innovations: handing off the control of one 


NPC, using a consistent voice, between the story and world simulation logics. 


Simulation logics 


In Fable's world simulation, non-player characters” linguistic behavior is determined 
by logics that operate at two levels. One logic determines the NPC*s attitude toward 
the player character, which shifts in response to player actions. The other logics 
decide whether to speak and what to say based on four factors: the type of NPC, 
the current circumstance, the attitude toward the player character determined by the 
first level's logic, and a random choice. 

The first level's logic operates by keeping, for each NPC, a representation of what 
they know of the hero — measured in terms of alignment (morality) and renown 
(fame and power). An NPC might respond with fear to a hero with an evil alignment 
and high renown, but react with disdain and mockery to a hero that is equally evil 
but of low renown. A simplified way of representing this might look like figure 4.2, 
with attitude toward the player as a dot that moves as the NPC*s knowledge of the 
hero's alignment and renown shifts. 

The logics of the second level are somewhat more complicated. There are 17 differ- 


ent NPC groups, and Leach reports that they all have different voices and wordings. 
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fear awe  worship 


Renown respect 


disdain friendly 


none 


Figure 4.2: NPC attitudes toward Fable's player character 


NPC groups include males, females, guards, bandits, children, etc. NPCs don't just 
respond to the player character. They also make comments to one another (often 
about the player character, within earshot) and to themselves (just audibly). There 


are many circumstances that might elicit a response, including: 


Being greeted 

Being greeted again 
Being followed 
Wearing odd clothes 
Carrying odd objects 
Seeing a theft 

Being assaulted 
Seeing an assault 
Being given a gift 
Seeing a trespass 
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Being apologised to 

Being insulted 

Seeing murder 

Seeing decapitation 

Being courted 

Seeing vandalism 

Seeing kindness 

Receiving kindness 

Being cheated on (in marriage) 


In addition, simply catching sight of the player character can be enough to elicit a 
response from many non-player characters. The decision of whether this will happen 
appears to be determined by a chance process. Responses can also be triggered by 
ongoing states, such as being in a frightening circumstance. Given this, in crowded 
or dangerous circumstances NPCs will often have much to say. When what an NPC 
will say involves the player character, the choice of utterance will be made by cross- 
referencing with the current state of the attitude captured by the first logic. However, 
some particularly powerful NPC relationships (e.g., love) and particular player char- 
acter states can override this logic in choosing reponses. Here, for example, is one 


NPC group's set of possible responses to having their property trespassed upon: 


TEXT_AIl DEED_REACT_CRIME_TRESPASS_NONE 
You're trespassing. 


TEXT_ALDEED_REACT_CRIME_TRESPASS RESPECT 
What brings you here? 
TEXT_ALDEED_REACT_CRIME_TRESPASS_AWE 
Your Lordship? 
TEXT_ALDEED_REACT_CRIME_TRESPASS_DISDAIN 
You shouldn't be here! 


TEXT_AIDEED_REACT_CRIME_TRESPASS_FEAR 
Er, can I help you? 
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TEXT_AIDEED_REACT_CRIME_TRESPASS_ FRIENDLY 
Here, didn't you know this is a private place? 


TEXT_AIDEED_REACT_CRIME_TRESPASS_WORSHIP 
May I humbly ask what brings you here? 


TEXT_AIl DEED-REACT_CRIME_TRESPASS _RIDICULOUS 
What do you want, fool? 


TEXT_AIl DEED.-REACT CRIME_TRESPASS OFFENSIVE 
Hey. You're trespassing! 


TEXT_AIl DEED.-REACT CRIME_TRESPASS_AGREEABLE 
You! Hello there! 
TEXT_AI DEED._REACT_CRIME_TRESPASS_UGLY 
You aren't supposed to be here, you horror. 


TEXT_ALDEED_REACT_CRIME_TRESPASS_ATTRACTED 
Oh hel-lo there. 


TEXT_AI DEED._REACT_CRIME_TRESPASS.LOVE 
Wonderful to see you! 





Non-player characters can also have multiple things to say about the player char- 
acter in one circumstance, which appear to be selected between randomly. Together 
with the many reasons for speaking and the many possible knowledge states about 
the player, this created the need for 14,000 lines of NPC dialogue for the world. And 
within this huge number of lines were nearly-impossible writing tasks, such as cre- 
ating 200 different ways of saying hello that expressed different character types and 
degrees of awe. So here the team took advantage of the fact that they were using 


skilled voice actors. As Leach reports: 


In the end we actually explained to each voice actor how the tag indicates 
the attitude and let them ad lib whatever they wanted. This resulted 
in some extremely natural (not to say comic) lines, but we had to go 
through afterwards and transcribe each and every one for the localisation 
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teams. And there were 60,000 words to check, some of which were drunken 
mumbles or, frankly, just noises! 


Combining this great variety with the lines of dialogue controlled by the story log- 
ics results in a count, for Fable, well over 150,000 words of dialogue. For comparison, 
this represents more words than there are in this study. 

Different NPC types seem to have a comparatively smaller variety of animations 
available to them. When under control of the world simulation they walk, stand, or 
sit — sometimes walking together or standing in groups — and select, apparently 
through a partially-random process, from a small number of gestures and expressions 
(e.g., looking up at the sky while shielding their eyes, crossing and uncrossing their 
arms, scratching their heads, talking animatedly). The same NPC, however, can have 
multiple appearances. For example, village woman NPCs can have long hair worn 
down, a ponytail, or short hair; have brown, blonde, or red hair; wear a vest, bodice, 


or blouse; wear pants or a skirt; and so on. 


Playing with story and simulation logics 


We see, in Fable's world simulation logics, a different structure of graphical and 
linguistic content from that created by the story logics. In the interactions controlled 
by story logics there are scene-specific animations, scene-specific lines of dialogue, and 
generic animations (presumably, for example, a number of lines in different scenes 
might be spoken with the animation for “attitude_sadness”). The world simulation 
logics, on the other hand, require both generic animations and generic dialogue. The 


same animations and lines of dialogue might be used multiple times, by multiple 
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characters (of the same NPC type), in multiple scenes. In each case, however, the 
operations of the current logics trigger NPC animation and speech in the same way. 
In some cases this is fine, but in others it is problematic. A simple description of 
the processes does not make this apparent, but even a few minutes of playing Fable 
reveals it. 

Fable begins in the player character's home town, when he is a boy, before he is 
set on the path to becoming a hero. The player is immediately given a number of op- 
portunities to guide the character toward positive or negative actions (as categorized 
by the game) and set Fable's moral machinery in motion. Many of the non-player 
characters, and many of the possible actions, are clustered around a clearing with a 
large tree in the center (and surrounding buildings). This appears to play the role of 
town square. 

But not all possible actions are in the center of things. The player character 
begins at home, with his father, who tells him (and players) that he needs to buy 
his sister a birthday present, and that the father will give him a gold coin for each 
good deed. The interface then tells the player that interesting people for conversation 
will glow green around the edges. Then the game itself begins on the path outside 
the player character?s house, with a small girl walking down the path, a little ahead, 
glowing green as the player character nears her. What follows is an account of the 
beginning of one of my sessions playing Fable, intended to illustrate how the different 
logics drive NPC behavior during the interactive experience of gameplay. 

I begin my play session by directing the player character to speak with the girl 


— which doesn't cause him to say anything audible to players, but does cause her to 


185 


reply. She, under the control of story logics, explains that her toy bear is lost and 
asks for the player character's help in finding it. After she explains this, I direct the 
player character to speak with her again. She says, again under story logic control, 
“Please find Rosie, my little toy bear, for me!” As she says this, she goes through 
a speaking animation, the grass behind her sways in the wind, light shines on the 
chicken standing in the grass, music plays softly, and birds twitter. I move the player 
character away slightly, then return to speak with her again. She says exactly the 
same thing — plays exactly the same recording. This dominates the auditory field, 
and it sounds like I'm hearing the same thing again. At a linguistic level it is, in fact, 
exact repetition (even if, on a wider auditory level, the music and bird sounds are not 
exactly the same). However, because I have moved the player character slightly, the 
visual presentation of her speech is different. Rather than a chicken and some grass 
behind her, the scene is of decorated wooden steps and the side of a stone building. 1 
don't experience this as visual repetition. In other words, her behavior is repetitive, 
but this repetition is experienced differently in its auditory (and, especially, linguistic) 
context than it is in its visual context. 

As the player, P'd like to try to move her out of this repetitive one-line interaction 
driven by the story logics, but I don't have any appropriate options available to me. 
As a boy the player character lacks the expressions that will be available later (e.g., 
displaying trophies, flirting) and the player character's equipment doesn't yet include 
any other clothes he could change into in an attempt to extract a response. Running 
into her physically just pushes her out of the way. Following her just causes her to 


pause and look at the player character occasionally. Hitting her, because she is a 
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story character, actually returns us to the story logics — it's counted as the player 
characters first bad deed, and she cries, “Hey, don't be nasty!” Hitting her additional 
times results in the same sort of repetition as speaking with her (and doesn't appear 
to result in racking up any additional bad deeds). In other words, the exploration of 
Fable's alternating world and story control will have to wait until later, but we have 
already learned something about Fable's story logics. Characters under the control 
of story logics have only one thing to say to the player character at each point in 
the story state. If it is possible to interact with that character repeatedly without 
changing the story state, the result is repetition on both linguistic and animation 
levels. However, linguistic repetition dominates the auditory field much more than 
animation repetition dominates the visual field. 

Here Fable is displaying common behavior for games of this type. For example, 
2003's Star Wars: Knights of the Old Republic — which will be abbreviated “KotOR” 
— begins with a similar NPC interaction. The player character begins the game by 
talking with Trask Ulgo, another member of the crew of a space ship: the Endar 
Spire. The interaction takes the Trask NPC through several states in the story logics 
of KotOR. In any particular state (e.g., that of telling the player character to put 
on equipment) the same audio recording and character animation is played for each 
interaction. However, the visual field changes in two ways. First, as with Fable, 
the larger background against which the NPC speaks changes based on where the 
player character is standing. Second, KotOR doesnt have only one possible player 
character. Instead, the player, at the beginning of the game, selects the type of 


player character (soldier, scoundrel, or scout), the player character's gender, and the 
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player character's physical appearance (from a gallery of possible combinations of 
facial features, skin tones, and hair styles). Given these factors, the experience of 
repetition is like that of Fable. It is much greater on the linguistic level than on the 
visual level — from interaction to interaction, and from game to game. Nevertheless, 
on both a visual and linguistic level there is a mechanical feeling to the inhabitants of 
game worlds controlled only by logics such as Fable and KotOR's story logics. Fable's 
world simulation logics are an experiment that attempts to make the world and its 
characters feel more alive. 

Returning to my session of playing Fable, 1 next direct the player character around 
the building in front of which he spoke with the girl. Behind the building a couple are 
having a secret tryst. Listening in on their conversation, driven by the story logics, it 
is clear that the man is cheating on his wife. They don't notice the player character 
until I direct him to speak with the man, who glows green when the character faces 
him. The man offers a gold piece if the player character will stay silent about it, and 
I direct the character to accept the gold, which counts as another bad deed. Running 
back around the house, the little girl is still there, a little further along the path. 1 
direct the player character to speak with her, and now her response is different. She 
cowers a bit, and raises an arm as if to shield her face, as she says, “I can't find 
my bear ... don't be mean, just please help me find her.” Here we see one initial 
difference between Fable and KotOR. The player character in Fable can attack any 
NPC, can move the experience partially to the control of combat logics at any time, 
and the story logics include data that reflects this possibility — this is a significant 


addition to the authoring work for the story, but also a significant addition to the 
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player character's moral choices, which are central to Fable. On the other hand, 
KotOR (which similarly has the player character's choice between the dark and light 
sides of the force as one of its themes) allows combat only when system-defined enemy 
characters are present, and only allows those characters to be targeted. 

I direct the player character to walk down the path to the main area of town, 
by the tree. Here we see the first examples of NPCs speaking in a manner driven 
by world simulation logics. One woman comments, as I walk by, “His mother won't 
be pleased when she hears what he's been up to.” A man says, “There's that little 
tear-away.” I tell the player character to keep moving, but he's caught by a member 
of the town guard, operating under story logics. He accuses the player character of 
two crimes, for which he'd be fined if he were older: “Violent Conduct Towards a 
Person or Persons Without Guard's Permission” and “Accessory to Improper Man 
and Lady Behavior.” It seems the lovers” tryst wasn't a secret worth paying the 
player character to keep, if even the payment is known around town so quickly. 

I direct the player character up a path, where a bully is intimidating a smaller 
boy. I target the bully, using the same method used to target the small girl, and 
hit him. This time, because he's intended to be fought, a progress bar appears — 
indicating how close the player character is to beating him into submission. When 
the bar's length is accomplished, the NPC gives up and control is returned to the 
story logics. Under their direction he speaks a line before running away. Unlike the 
previous beating, this counts as a good deed, and will be rewarded with a gold piece 
if the player character returns to his father. Next, the smaller boy thanks the player 


character, and gives him the girl's lost bear to look after. 
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The player character now only needs one more remuneration-producing action 
before buying his sister's present. The fastest way to payment is to head into the 
thick of things, and Í steer him back to the area around the tree. The girl who has 
lost her bear is with the group of NPCs there, and giving the bear to her would count 
as a good deed. Also, the woman whose husband is cheating on her is there and, 
even though it requires breaking a promise, telling her about her husband's infidelity 
counts as a good deed as well. 

As I guide the player character to the area around the tree, everything basically 
looks fine. There are a number of NPCs of the same type — multiple village women 
and village men — but they have different hair styles, hair colors, and clothing. 
The characters sit down on the benches around the tree and get back up, walk one 
direction and then another, turn to talk with one another and then turn away, look 
up at the sky and then back down. Many characters take similar actions, but they 
do them in different parts of the screen, facing in different directions, and surrounded 
by different scenery. The impression of a complex visual field containing a circulating 
group of people emerges from the NPCs making simple random choices between a 
limited number of animations. 

At first things also work well on a linguistic level. A man, facing a woman, says, 
“You never can tell with that kid.” She replies, “I wonder what he'll do next.” 
It's a triumph for the world simulation logics, creating an impression of a living 
community of NPCs quite different from games like KotOR. Then the woman says, 
“There's something different about that boy.” The man replies, “You never can tell 


with that kid.” Here, we have repetition, with the man saying exactly what he said 
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a moment ago. And somehow it's more disconcerting, because it does not emerge (as 
the story logic repetitions do) from the player character prompting the NPC to speak 
again while the story logics remain in the same state. A moment later, the problem 
makes itself yet more strongly felt as two women (one with blonde hair, one with 
red) are speaking together. One says, “What a strange little boy he is!” The other 
replies, “What a strange little boy he is!” And the first woman's rejoinder is, “What 
a strange little boy he is!” Such repetition is a powerful technique for language — 
it can create the radical effects of Beckett or the traditional rhythms of folktale and 
chant — but none of these are among Fable's goals for its NPC dialogue. 

In other parts of the game, if I were to keep the player character going past 
this point, Pd find more successful circumstances. Individual characters would make 
appropriate comments as the player character passed, and accurately use his title 
(which, rather annoyingly, remains “Chicken Chaser” until another is purchased, even 
if he accomplishes deeds of some renown). The world simulation logics would also 
allow for gigeling, flirting, and other kinds of interaction with almost any NPC. And, 
of course, the alternation between story and world logics would work smoothly and 
impressively even for some characters who aren't gate keepers. For example, on the 
first quest of any length, the player character encounters the man who lives at Fisher 
Creek. This NPC speaks his story dialogue, responds appropriately to giggles, and 
makes small talk driven by world simulation logics when his story lines are exhausted. 

But, overall, the strongest impression made by Fable's NPCs operating under 
world simulation logics is of their moments of linguistic repetition. This problem, in 


turn, arises from a simple error that the simulation logics appear to have inherited 
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from the more established story logics: treating NPC animations and speech with the 


same logics. 


Speaking and moving in Fable 


In the case of NPCs controlled by Fable's story logics, a particular story event triggers 
a particular animation and speech. Either this event is not repeated, or it is repeated 
based on the player characters actions. Given this, a repeated story event can pro- 
duce repetitions of NPC animation and speech, but this is not especially problematic, 
given that the repetitions emerge from repetitive actions on the part of the player 
character. 

In the case of Fable's world simulation logics, NPC animation and speech are also 
triggered by the same logics, but in this case both logics are supported by processes 
that have an important random element. Characters decide whether to move and 
whether to speak through partially-random processes. If they have multiple move- 
ments they can make, the choice has a random element. If they have multiple things 
they can say about the player character, the choice of which has a random element. 
For choosing animations this is a fine logic, given all the visual variety in Fable. But 
it produces jarring linguistic repetitions all too frequently. 

In other words, what works for Fable*s world simulation as a graphical logic fails 
as a linguistic logic. And this is an issue more generally. System designers selecting 
an abstract process or lower-level logic to support a particular operational logic need 
to carefully consider (and test) whether the abstract process or lower-level logic ap- 


propriately treats the data with which that operational logic will operate. N-grams 
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performed with words can produce evocative strings of language, reminiscent of the 
human language use that provided the data files. N-grams performed with pixels do 
not result in pictures reminiscent of those providing the pixels, because the serial 
ordering at the root of the n-gram's abstract process is not a good match with how 
pixels function in images. 

In the case of Fable, one can't help but wonder if this mismatch could have been 
avoided by simple means. The problem might have been addressed, for example, by 
involving writers more in the design of the system that would perform their words. 
Writers, aware of the power of exact repetition in language, might have suggested 
an alternative logic as simple as “choose one not yet used this state of the quest” or 
“not used in the last 10 minutes of gameplay.” Of course, this might have required 
giving characters a nonverbal response to use as a fallback when all linguistic ones 
were temporarily exhausted, or dictated changing the characters” attitudes so that a 
new set of responses would become active. The consequences of such choices might be 
difficult to predict, but one suspects they would have had a greater chance of success. 

Of course, this would have only addressed the most glaring problem. Other, 
deeper, issues would have remained. For example, alternating between the control 
of story logics and world simulation logics is acceptable for animation, because a 
sequence of physical actions does not have to be strongly ordered for us to parse it as 
appropriate. On the other hand, during linguistic interaction ordering is crucial. The 
same statements, made in different orders, can produce either a sensible conversation 
or a series of non-sequitors. Because Fable's world simulation logics select speech acts 


with no reference to the larger linguistic context in which they are embedded, only 


193 


clever writing (of which Leach and his team managed quite a bit) can prevent every 
statement of NPCs controlled by the world logics from seeming a complete break with 
all interaction up to that point. Again, the problem is the assumption that graphical 
and linguistic behavior can be controlled by the same logics. 

Further, even within the story logics, which do maintain a certain linguistic order 
and context, there is no way within the current structure to get away from repetitive 
responses to player character actions. The problem is twofold. First, on an imple- 
mentation level, NPC dialogue is stored as pre-recorded sound files. These files take 
up a large amount of space, and so no game can include too large a number of them. 
In fact, Fable developer Dene Carter cites “early indications of the size of the audio 
files” as one of the key reasons that the original ambitions for Fable had to be scaled 
back (XBN, 2004). Just as with the large array size in Bennett's implementation of 
n-gram text generation, this implementation problem (which exists for many games) 
might be solved by a more clever implementation. For example, it might be possible 
to achieve greater file-size compression of the audio, so that each piece takes up less 
space. Or it might be possible to cut sound files into smaller pieces and then smoothly 
recombine them into a greater variety of utterances. Without such a solution it will 
be necessary to use repetitive responses because there is not room to store multiple 
responses to most events. 

But, even if one of these implementation improvements, or a different one, could 
overcome the problem with the size of audio files, another problem would remain. 
This problem has to do with the structure of the story logics themselves. In this 


structure, the story moves from state to state, and the current state determines the 
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response of the characters. So, for example, before the player character speaks with 
the little girl, the story logics are in one state, and so she gives a certain response 
when spoken to. Once this has happened, she gives another response when spoken 
to, because the first interaction changes the story logics” state. But speaking to her 
a third time does not change the story state, and neither does speaking with her for 
a fourth or fifth time — because each of these states would have to be individually 
authored and represented in the story structure: a process simply too laborious to 
be worthwhile. The same is true of hitting the girl. It changes the story state the 
first time, but not any time after that. And other variables must be excluded. For 
example, while 1t is possible to hit the girl within sight of the player character's father, 
he will not react to this action. For him to do so would require explicitly authoring 
a version of the story state in which that event involves him. 

However, Fable can be seen as on the track toward addressing problems such 
as these. Its world simulation logics are a first step. Characters currently have 
a representation of what they know about the player character that can determine 
their responses. A more detailed representation might allow the girl to be increasingly 
upset each time she is hit. A different approach to language (rather than pre-recorded 
files) might allow her to generate responses that reveal her changing feelings through 
what she says. And a different approach to action (rather than the limited positions 
of the story logics or the random choices of the world simulation logics) might allow 
her to run for help to the player character's father, or for the father to spontaneously 
come to her aid. And, in fact, people have been actively working on such richer 


models of character, story, and language for decades — under the rubrics of Artificial 
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Intelligence (AD) and Natural Language Generation (NLG), which will be major topics 
of discussion in later chapters of this study. 

These techniques are slowly finding their ways into computer games. The pace is 
slow, in part, because these approaches have problems of their own. In any case, until 
such techniques are mature and adopted, we will continue to have computer games 
in which all language is selected from a small number of fixed pieces of data, while 
visuals are created by blending many dynamic elements. In these circumstances, 


language and animation must be treated with different logics. 


Graphical and textual logics 


Fable is far from the only digital media project to treat language and imagery with 
the same logics. In fact, one of the most common approaches to text in digital media 
is to treat it with graphical logics. Projects like Dakota (Chapter 2, page 68) bring 
to text the logic of the motion picture (as in Brown's “readies”). Given that reading, 
music, and animation can all be rhythmic, serial events, the mapping can work well. 
Its result is a new experience of reading. 

Similarly, but in an interactive vein, projects like Camille Utterback and Romy 
Achituv's Text Rain (figure 4.3) invite and structure play with texts, but employing 
eraphical logics. Text Rain shows audience members a video image of themselves in 
an alternate reality. The letters of lines of poetry fall from above, coming to rest on 
anything darker than the background — usually part of an audience member's body. 
The letters can then be lifted, and let fall again, until they begin to fade as new letters 


fall. In other words, Text Rain makes the logic of collision detection something that 
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Figure 4.3: Lifting words in Text Rain 


allows an audience to play with text, together, using movement of their bodies. The 
text itself is about bodies, relationship, and language. 

The logic of collision detection is also central to the textual virtual reality piece 
Screen (figure 4.4) on which 1 collaborated with Sascha Becker, Josh Carroll, Robert 
Coover, Andrew McClain, and Shawn Greenlee. Screen was created in Brown Univer- 
sity's “Cave,” a room-sized virtual reality display. It begins as a reading and listening 
experience. Memory texts appear on the Cave's walls, surrounding the reader. Then 
words begin to come loose. T'he reader finds she can knock them back with her hand 
— collision detection — but words don't always return to where they came from, 
words can break apart, and peeling increases steadily. Eventually, when too many 
words are off the wall, everything remaining comes loose, swirls around the reader, 
and collapses — triggering a final text, read aloud but not seen. 


For each of these projects, particular textual data is employed that fits with the 
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Figure 4.4: Collapsing words in Screen 


particular use being made of graphical logics. If generic text were used, or if the 
projects were opened to arbitrary textual data, the results would be quite different. 
There is something unsatisfying about projects, quite a few of which exist, that 
present a set of implemented graphical logics to be filled in with arbitrary text (though 
the intention may be to present a challenge similar to the Oulipian projects discussed 
in the next section). 

The dissatisfaction doesn't occur in the same way when graphical data is used 
with graphical logics. Modifying graphical games by re-skinning them (replacing 
their graphical data) is more like providing new words for the textual logic of the 
crossword puzzle: the structure is designed for such replacement. 


Which brings us back to an issue we haven't yet considered from our earlier discus- 
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sion of n-gram projects. What concept and vocabulary should we use for describing 
works such as Travesty — implemented textual logics presented for use with arbitrary 
text? In creating Travesty, Kenner and O'Rourke clearly created some sort of work. 
Travesty is a complete system, its source code is published and widely distributed, 
it has a name, many have used it. Travesty is couched in literary terms, and writers 
such as Hartman have used it to produce literary work, but it seems not quite appro- 
priate to call it a work of literature — it comes with no data. It is a fully operational 
literary process, distributed with the intention that others supply appropriate data 
(linear sets of words), and with resulting surfaces that can range even for one person 
from Hartman's “childish pleasures” to his high literature. How should we think of 
such a thing? 

Though it may seem unlikely, in fact such a situation is not unique to digital 
literature. A group for which digital work is a secondary concern, a group that has 
been mentioned several times in previous chapters, has already thought through this 


issue in a manner that will prove useful. 


Potential Literature 


Raymond Queneau was a member of the Surrealist group — one of the many excom- 
municated during that group's tumultuous history. Years later, when he and Francois 
Le Lionnais founded the Oulipo (Ouvroir de Littérature Potentielle, or Workshop for 
Potential Lterature), the group decided to make this sort of occurrence impossible. 


The rules of the Oulipo forbid anyone from being expelled, as well as forbidding any- 
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one from resigning or ceasing to belong — a restriction that carries on past death, so 
that all deceased Oulipians are still counted among the current membership (though 


9 


they are excused from meetings).” At the same time, as Jacques Roubaud notes 


(1998), “lest the last rule seem unduly coercive”: 


One may relinquish membership of the Oulipo under the following cir- 
cumstances: suicide may be committed in the presence of an officer of 
the court, who then ascertains that, according to the Oulipian's explicit 
last wishes, his suicide was intended to release him from the Oulipo and 
restore his freedom of manoeuvre for the rest of eternity. (p. 38) 


It may in part be due to policies such as these, or the attitudes that inspired them, 
that the Oulipo continues operation today, more than 35 years after its founding — 
avoiding the infighting and dissolution that marked both Dada and Surrealism. The 
Oulipo's continued existence may also serve as a testament to the ongoing relevance 
of its basic work. That basic work, like its membership policies, may be most easily 
understood in contrast with Surrealism. The Surrealists — through techniques such 
as automatic writing — wanted to free the creativity of the unconscious by removing 
everything that stood in its way, including conscious thought about one's work. The 
Oulipo, on the other hand, propose methods of constraint and formal procedures that 
may assist in the production of conscious literature. Queneau's famous definition calls 


Oulipians “rats who build the labyrinth from which they plan to escape.” 





“The initial 25 members include the founders — Noél Arnaud, Jacques Bens, Claude Berge, 
Jacques Duchateau, Latis, Francois Le Lionnais, Jean Lescure, Raymond Queneau, Jean Queval, 
and Albert-Marie Schmidt — and the first 15 additions: Marcel Bénabou, André Blavier, Paul 
Braffort, Italo Calvino, Franois Caradec, Ross Chambers, Stanley Chapman, Marcel Duchamp, Luc 
Etienne, Paul Fournel, Jacques Jouet, Harry Mathews, Michele Métail, Georges Perec, Jacques 
Roubaud. 
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Most Oulipian proposals take the form of constraints on the work of writers. The 
Oulipian stance is that all writing is constrained writing, but most writers work within 
constraints that are traditional (e.g., the sonnet), so ingrained as to be almost invisible 
(e.g., the “realist” novel), or, perhaps worst, unknown to the writer (e.g., automatic 
writing). The Oulipo aim to supplement or replace these with other constraints 
of two sorts. One sort are traditional constraints, once largely forgotten, which are 
revived. The other sort are constraints new to literature, most of them adopted 
from mathematics (and mathematical games).*! Georges Perec was an Oulipian who 
excelled at the use of both sorts of constraint — for example writing a novel without 
the letter “e” (the “lipogram” is a constraint with a long tradition) and another novel 
structured by innovative application of “the Graeco-Latin bi-square, the Knight's 
Tour, and a permuting “schedule of obligations' ” (Mathews and Brotchie, 1998, 
170). 


Transformations and interactions 


In addition to their work on constraints for writing, there are also two other types 
of Oulipian proposals worth considering (though neither has as many examples as 
the work with constraints). One type are formal procedures for the transformation 


of text via substitution, reduction, and permutation (either of one's own text or of 





Roubaud quotes Queneau as writing, “The classical playwright who writes his tragedy observing 
a certain number of familiar rules is freer than the poet who writes that which comes into his head 
and who is the slave of other rules of which he is ignorant” (1986, 87). 


“The Oulipo's membership has always included both writers and mathematicians, only some of 
whom have practiced in both areas. 
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found text). The most famous Oulipian procedure of this sort, “N + 7,” I will discuss 
below. 

Another, which like “N + 7” operates by substitution, is the “chimera,” which 
one produces using four texts.*? From the first text the nouns, verbs, and adjectives 
are removed. These are then replaced with the nouns taken in order from the second 
text, the verbs from the third text, and the adjectives from the fourth text. Similarly, 
“definitional literature” replaces a text's verbs, nouns, adjectives, and adverbs with 
their dictionary definitions, and can be applied recursively (replacing those words 
with their definitions, and so on). 

The Oulipo's procedures for radical reduction through selective retainment are 
sometimes called “haikuization.” These include the reduction of poems to their 
rhymed line endings (as visited by Queneau on Mallaremé) and running together 
the first and last words of successive lines of a poem. The Oulipopo, a descendent of 
the Oulipo dedicated to the investigation of detective fiction, takes this to an extreme 
by proposing the reduction of detective novels to only their first and last sentences. 
“Slenderizing,” on the other hand, consists of the removal of all instances of a par- 
ticular letter throughout a text — a procedure not all texts can survive sensibly. 

Jean Lescure has provided two Oulipian procedures for permutation within one 
text, rather than substitution between texts. His method for found text encompasses 
four sub-types. In “plain” permutation the first noun changes place with the second, 


the second with the third, the third with the fourth, etc. In “alternate” permutation 





12The “chimera” and other constraints in this section are presented as described in the Oulipo 
Compendium (Mathews and Brotchie, 1998). 
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the first noun changes with the third, the second with the fourth, etc. In “bracketed” 
the first with the fourth, the second with the third, etc. In “Roussellian” (named 
for the sentence structure of Raymond Roussel's Nouvelles Impressions d'Afrique) 
the first with the last, the second with the next to last, etc. The “Lescurian word 
square,” on the other hand, is a method for composing text (and, in this sense, a 
kind of constraint) that generates a line of poetry for each possible permutation of 
the words written, employing a minimal number of connecting words (e.g., 24 lines 
from four words). 

In addition to these types of transformations, and the larger category of con- 
straints for writing, Oulipian practice also contains another type of proposal: struc- 
tures for audience interaction. 1 have already discussed one of these, Queneau's One 
Hundred Thousand Billion Poems, in some detail. A later chapter will look at two 
other Oulipian structures of this sort, those for branching stories and performances. 

The next step in this chapter will be short examinations of two Oulipian processes 
— one a writing constraint, the other a transformation procedure — in comparison 


with Surrealist and Dada processes described in the previous chapter. 


Larding 


The process of “larding,” also known as “line-stretcher's constraint” (after 19th Cen- 
tury writers who were paid by the line), is a very simple constraint. From a given 
text, pick two sentences. Then add another sentence in the interval between them. 
Then add another sentence in each of the two available intervals of the new text 


(between first and second, between second and third). Then add another sentence in 
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each of the four available intervals, and so on until the desired length is reached. 
Here is a short example that I constructed, beginning with a haikuization of 


Dashiell Hammett's The Dain Curse: 


It was a diamond all right, shining in the grass half a dozen feet from the 
blue brick wall. I had an idea he thought I hadn't a refining influence. 


It was a diamond all right, shining in the grass half a dozen feet from the 
blue brick wall. I didn't think he'd want me touching it. I had an idea he 
thought I hadn't a refining influence. 


It was a diamond all right, shining in the grass half a dozen feet from the 
blue brick wall. I started to crouch for it, but hesitated at the sight of 
Mr. Coburn rounding the corner in his black dinner jacket. I didn't think 
he'd want me touching it. 1 seemed to make his mustache wax soften just 
entering a room. 1 had an idea he thought I hadn% a refining influence. 


It was a diamond all right, shining in the grass half a dozen feet from 
the blue brick wall. Just to the left of my shoe. Í started to crouch for 
it, but hesitated at the sight of Mr. Coburn rounding the corner in his 
black dinner jacket. His frown was supported by deep lines in his cheeks. 
I didn't think he'd want me touching it. I don't think he'd want me 
touching anything on his face. 1 seemed to make his mustache wax soften 
just entering into a room. It was almost mutual — the wax's smell made 
my spit a little more sour. 1 had an idea he thought I hadn%t a refining 
influence. 


I have used “larding” as an exercise in writing workshops, a situation in which 
it can be practiced individually or collaboratively. In the collaborative version, one 
person chooses the initial sentence pair, the next inserts a sentence between them, 
the next inserts a sentence into each of the two available intervals, and so on. One 
thing that the collaborative version brings to the fore, strikingly, is the similarity of 
the form of this activity to that of the Surrealist “exquisite corpse” (see page 128). 


We are passing a piece of paper from person to person, each of us continuing — in 
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the space allotted — the writing of the person who preceded us, just as we would 
with the Surrealist activity. 

And yet the experience of writing text for “exquisite corpse” or “larding” is utterly 
different. The writer in the Surrealist process is blind to the text that will surround 
their contribution, and tempted toward a minor cleverness of the sort that Mad Libs 
inspires. Whereas the writer participating in the Oulipian process is working with 
open eyes — with perception of the shape of the overall text, and of the specific 
sentence gaps to be filled, creating the sense of seeking solutions to a literary puzzle 
that can be steered in many potentially correct directions. 

This seems to fit well with our understanding of the difference between Surrealism 
and Oulipo. One is interested in what will happen if we let go any attempt at control, 
if we deny ourselves the information or state of mind that might tempt us to try to 
control the situation, allowing us to think in new ways. The other is interested in 
setting a linguistic challenge of an unusual and interesting shape that will motivate 
us think in new ways. And yet this understanding seems difficult, at first, to fit with 


the workings of the Oulipo's transformational processes, such as “N + 7.” 


N+" 


This process (sometimes known as “S + 7”) is also quite simple. A text is transformed 
by substituting each noun with a different noun — the seventh noun after it in a 
given dictionary. Obviously, different dictionaries, and particularly dictionaries of 
different lengths, will produce different results. So, for example, the 1975 edition of 


the Random House College Dictionary produces: 
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To be or not to be: that is the quibble 


On some level this seems very much a “chance procedure” — of just the sort the 
Oulipo are avowed to oppose. The choice of dictionary is, presumably, going to be 
determined by random circumstance: what dictionaries are available. The use of the 
dictionary, itself, seems like an element of the process designed to foreclose conscious 
choice in the act of word replacement (the selection of new nouns could, instead, have 
been presented as a constrained writing exercise). Isn't this basically the Surrealist 
approach to writing? 

Not necessarily. But to understand why we'll need to revisit the specifics of 
Surrealist automatic writing (page 125), Tzara's newspaper poem (page 120), and 
the Oulipian opposition to randomness (page 112). 

Oulipian Claude Berge explained the difference between the Oulipo's concept of 
“potential” and the notion of “chance” as follows: “potentiality is uncertain, but not 
a matter of chance. We know perfectly well everything that can happen, but we don't 
know whether it will happen.” In the “N + 7” process one knows everything that 
can happen. It's a formally defined procedure, an algorithm, even if the specific data 
to be used is unknown. Whereas the Surrealist model of indeterminacy is to be open 
to anything, to the unforeseeable. 

Which returns us to Tzara's poem. We can see that this process, of drawing 
words from a sack, is also a formally defined procedure (even if a simple one, in 
which randomness is central). Florian Cramer, in Words Made Flesh (2005) explains 


the difference between a process like Tzara's newspaper poems and that of “true 
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indeterminacy” as the difference between “stochastic chance” and “philosophical- 


ontological chance”: 


Algorithms can be used for chance operations as Tzara's Dadaist poem 
shows, and — with much older origins probably in ancient India — the 
dice as a random computing device. But such random operations create 
stochastic chance, not philosophical-ontological chance. Throwing a die 
is a stochastic chance operation with the possible outcome of 1, 2, 3, 4, 
5, 6. Since these results are foreseeable as the set of potential results, 
they represent not an ontological, but a deterministic chance. Mallarmé 
describes it precisely in his sentence: “throwing the dice never abolishes 
chance.” Ontological chance, and therefore true indeterminacy, would 
occur if the die would crack, vanish, or show the number seven. (p. 78) 


Some of this we have previously discussed in relation to Espen Aarseth's work 
(page 111). But here, I think, we can use it to come to a more developed understand- 
ing of Oulipian procedures. These procedures contain many elements that appear to 
be based on chance (e.g., what dictionary is chosen for “N + 7”). But this is never 
philosophical-ontological chance, because the procedures are well defined. It is also 
not stochastic chance, because it is never a matter of rolling dice, flipping coins, or 
the equivalent. The elements of the process that are not predetermined are all arrived 
at through human choice. This can be the choice of the author, as in what dictionary 
to use for “N + 7.” It can also be the choice of the audience, as in Queneau's One 


Hundred Thousand Billion Poems.* 





l3Interestingly, this initial formulation makes Shannon's implementation of n-gram text gener- 
ation sound like an Oulipian procedure, whereas later computer implementations (which employ 
stochastic chance) are not. And yet Shannon's choices are understood to simulate stochastic chance 
— his paper even talks of the text he generates as “stochastic” approximations of English. From 
this we can see that, in order to achieve the “maximal motivation of the literary sign” that Motte 
discusses, the Oulipo need something more than arbitrary human choice, as the coming pages discuss. 
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On the other hand, it seems that choice is often being used as a source of inde- 
terminacy. As an author, 1 don't know what noun comes seventh after each of my 
text's nouns in a given dictionary. As an audience member, 1 don't know what the 
results of my choice will be in one of the Oulipo's branching narratives. 

But as an author, and perhaps also as an audience member, I may be missing 
the point of Oulipian processes if Il make my choices arbitrarily. A closer look at 
the relationship between processes such as “N + 7” and the larger body of Oulipian 


constraints will help us understand this. 


Transformation and constraint 


There is something strange about the relationship of Oulipian transformational pro- 
cedures, such “N + 7” and the “chimera,” to constraints such as “larding.” Further, 
compared with constraints such as the “lipogram” the relationship appears to ap- 
proach the paradoxical. After all, writing a French text without the letter “e” is an 
extremely demanding act of creativity. The procedures of “N + 7” or “chimera,” 
on the other hand, could be carried out by a trivially simple computer program (as- 
suming it could access part of speech information, either from human markup or a 
tagging program). 

Or is this really true? Certainly it's true that a human may find it a challenge 
to write many sentences of French without using “e.” But a computer could actually 
provide a large number of such sentences relatively easily, by a number of methods — 
including searching through large bodies of text (“corpora”) for examples, generating 


sentences in a customized way, and attempting “repair” of sentences with small “e” 
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counts. 

But this isn't the challenge of the “lipogram.” Rather, its challenge is to write 
a literary text while avoiding the use of one or more letters. We are impressed with 
Perec's La disparition and with Gilbert Adair's English translation (4 Void) because 
they manage to be interesting literary texts produced without use of the most common 
letter in their respective languages, texts which take the absence at their hearts both 
as a theme and as an expression of what the author of the text cannot say directly 
(in part, the loss of Perec's parents during WWIT). Undertaking such a task is almost 
unimaginably difficult for human authors, and operationalizing such an approach for 
a computer is outside the realm of the possible. 

Given this line of reasoning, we might wonder if there are other versions of Oulipian 
transformation procedures that might be more difficult to operationalize. And, in 
fact, there are. For example, there is a version of “N + 7” used for classical poetry, 
which requires the selection of the next noun (appearing in the dictionary seven or 
more nouns after the original) that allows for the retention of meter and rhyme. 
Here is Wordsworth, processed by Harry Matthews through the 1993 Langenscheidt 


Standard English-German Dictionary: 


I wandered lonely as a crowd 

That floats on high o'er valves and ills 
When all at once I saw a shroud, 

A hound, of golden imbeciles; 

Beside the lamp, beneath the bees, 
Fluttering and dancing in the cheese. 


But a computer program could certainly be made to understand enough about 
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meter and rhyme to carry out this more complicated version of “N + 7.” A more 
challenging transformation to operationalize would be that of “semo-definitional lit- 


” an alternative to the rote replacement of words with their dictionary defini- 


erature, 
tions. In this version the replacements are performed toward one of two goals listed 
in the Oulipo Compendium: the first, “that of orienting the derivation towards the 
style or the ideas of a particular writer or kind of writer” and the second, “that of 
demonstrating the lexical equivalence of sharply divergent statements” (Mathews and 
Brotchie, 1998, 222). The examples given include the transformation of a common 
truism into a passage from Ecclesiastes, and from there into a quotation from Simone 
de Beauvoir — as well as a demonstration of the equivalence of the phrases “A free 
society needs a free market” and “Workers of the world unite!” 

Certainly, with this, we have reached an Oulipian transformation procedure which 
it would be quite difficult to perform via automatic computation, one which also 
presents real challenges to human creativity. But this is only one procedure out of 
many, with most others quite easy to perform via computer. For this reason it seems 
unlikely that we have yet reached an explanation of what makes transformational 


procedures fit with the larger Oulipian enterprise. 


Creations that create 


The focus in this chapter?s discussion of Oulipian transformational procedures, up to 
this point, has been on the processes. But this is quite different from how we think of 
Oulipian writing constraints, such as the “lipogam.” The point of Oulipian writing 


constraints is that they constrain at the point of data. The constraints operate to 
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limit what data is considered acceptable, not to determine the writing process that a 
writer uses to produce that data. If we allow this fact to guide our thinking, we can 
see that the challenge of Oulipian transformation procedures, such as “N + 7,” also 
lies at the point of data. It is not that the procedures are a challenge to perform, but 
that it is a challenge to identify — to choose — texts for which these are interesting 
transformations. Again, this is a challenge for which, across the board, computational 
procedures would be poorly suited. Meeting this challenge as a human is not best 
accomplished by a chance act, but by one requiring thought and effort, and in this 
way it is quite different from Shannon's stochastic act of flipping to random pages. 

How, then, do we think of Oulipian transformation procedures? They are well- 
defined processes, which are asserted to support a literary logic, and which are pro- 
vided in the expectation that others will produce the data that will join with these 
processes to create works of literature. Oulipian publications (notably, in English, 
the Oulipo Compendium) provide short examples of these transformations in use with 
literary texts. These are the same sort of demonstrations, building the understand- 
ing that a process supports a literary logic, that appear in the articles that brought 
n-gram text generation to microcomputers. In fact, we can gain some further insight 
into projects like Travesty by considering the concept the Oulipo use to describe their 
transformational procedures: “potential literature.” 

Raymond Queneau, as translated by Warren Motte, says of the Oulipo, “We call 
potential literature the search for new forms and structures which may be used by 
writers in any way they see fit” (Arnaud, 1986, xi). Harry Matthews puts it quite 


clearly in the Compendium: 
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The last word of Ouvroir de littérature potentielle defines the specificity of 
the Oulipo. From its beginnings the group has insisted on the distinction 
between “created creations” (créations créés) and “creations that create” 
(créations créantes), to the benefit of the latter: it has been concerned 
not with literary works but with procedures and structures capable of 
producing them. When the first sonnet was written almost a thousand 
years ago, what counted most was not the poem itself but a new poten- 
tiality of future poems. Such potentialities are what the Oulipo invents 
or rediscovers. (p. 208) 


The Oulipo clearly has this work at the heart of it efforts. But this has not always 
been understood. After all, literary groups — of which the Oulipo is certainly one 
— generally produce texts, rather than structures and processes for others to use in 
creating texts. And certainly members of the Oulipo have produced remarkable texts 
using Oulipian procedures, as the novels of Perec, Calvino, and Matthews attest. 
But these procedures have also been used for literary works by non-Oulipians. Just 
as Perec made masterful use of the lipogram, so Gilbert Sorrentino, Christopher 
Middleton, and others have made remarkable use of “N + 7.” But we must not, 
when impressed by these examples of procedures in use, allow this to cloud our vision 
of Oulipian potential. As Oulipo scholar Mark Wolff puts it, “Writing is a derivative 
activity: the Oulipo pursue what we might call speculative or theoretical literature 
and leave the application of the constraints to practitioners who may (or may not) 
find their procedures useful” (2005). 

Oulipians have even debated whether any example, any demonstration, is nec- 
essary in asserting that a structure or process can should be regarded as potential 


literature. Jacques Roubaud (1998) writes: 


A debate within the Oulipo, dating from early on, bears witness to this: 
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for a proposed constraint to be deemed Oulipian, must there exist at least 
one text composed according to this constraint? Most Oulipians answer 
yes. But President Le Lionais, ever the radical, tended to brush this 
requirement away. (p. 41) 


Potential implementations 


The Oulipo do not distinguish between their transformational procedures and other 
forms of constraint — both, in presenting a structured challenge to the creation of 
literary work, are “potential literature.” Of course, there is something that distin- 
guishes between Oulipian potential literature and works such as Travesty: the implied 
implementation of Oulipian works of potential literature is “by hand,” whereas the 
processes of Travesty are carried out by digital computation.!* This is important for 
two reasons. 


First, there are some of the differences we have already discussed between pro- 





l4Though the Oulipo have used computers, this has never been the primary method of implemen- 
tation for their works of potential literature. Mark Wolff (2005) describes early Oulipian computer 
experiments this way: 


When the Oulipo formed in 1960, one of the first things they discussed was using 
computers to read and write literature. They communicated regularly with Dmitri 
Starynkevitch, a computer programmer who helped develop the IBM SEA CAB 500 
computer. The relatively small size and low cost of the SEA CAB 500 along with its 
high-level programming language PAF (Programmation Automatique des Formules) 
provided the Oulipo with a precursor to the personal computer. Starynkevitch pre- 
sented the Oulipo with an “imaginary” telephone directory composed of realistic names 
and numbers generated by his computer. He also programmed the machine to com- 
pose sonnets from Queneau's Cent mille milliards de poemes. In both cases the Oulipo 
was impressed but did not believe these computer applications had “potential”. What 
worried the Oulipo was the aleatory nature of computer-assisted artistic creation: they 
sought to avoid chance and automatisms over which the computer user had no control. 


Eventually a subsidiary of the Oulipo was created to pursue computerized methods: Alamo, or 
Atelier de Littérature assistée par la Mathématique et les Ordinateurs (Literature Workshop aided 
by Mathematics and Computers). 
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cesses carried out through human effort as opposed to automatic computation. An 
author, for example, can use Travesty to explore textual possibilities without un- 
derstanding how it operates. This would be impossible with a process carried out 
by hand — such as Shannon's implementation of n-gram text generation, or any 
of the Oulipian procedures. Also, processes carried out by automatic computation 
can conceivably be made to operate quickly enough to form part of the support for 
qualitatively new experiences, including interactive ones (such as textual dialogues). 

The second significance of Oulipian procedures being carried out by hand, how- 
ever, is more specific to the group. Carrying out processes by hand introduces the 
possibility of error, of unpredictability, of failure — deliberate or otherwise — to 
follow the constraint with machine-like exactitude. The Oulipo refer to this as the 
“clinamen,” the play in the system. Calvino has suggested, in reference to computer- 
assisted writing, that it is the “ “clinamen” which, alone, can make of the text a 
true work of art” (1986, 152). Some critics have argued that the clinamen restores 
the model of literary genius, a notion the Oulipo would like to view as Romantic 
or Surrealist, to Oulipian work. But the non-mechanistic need not come from the 
author to become part of the work. If there is a clinamen, for example, in many of 
the digital works inspired by the Oulipo, it is introduced by the readers — it is part 
of interaction (Wardrip-Fruin, 1996). 

Given these differences, and the potential for confusion, the name “potential liter- 
ature” should probably be reserved for Oulipian works. This leaves us to find another 
term that can encompass the full set of process-oriented literary works that are pro- 


vided without data (a category that overlaps with, but is not identical to, “potential 
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literature”). And, of course, it is particularly important for this study's purposes 
to find a name appropriate for process-intensive computational works such as Trau- 
esty. Given this, the candidate term employed for the remainder of this study will be 
“works of process,” though a better term may be found in time. The term will also 
be used in describing process-oriented Oulipian works of potential literature, as seen 


from this perspective. 


Reconsidering “Processes” 


Now, with the concept and term “works of process” in our active vocabulary, let us 
review the more complex terminology this chapter has developed as a compliment to 


the simple “surface, data, and process” triad used until now: 


Data is the collection of text, sound files, images, templates, tables of numbers, 
and/or other static assets employed by a process. For Shannon's n-grams, this 
is the book flipped through and read. For Tzara's newspaper poem it is the 
original article, to be cut into word-bearing scraps of paper. For Travesty it is 


the input file, for “N + 7” the starting text and dictionary to be used. 


Implemented processes are concrete realizations of abstract processes. Some im- 
plementations are through human effort, others through automatic computa- 
tion. Some implementations are more efficient than others, which can determine 
whether work of a particular sort is possible with the available resources (e.g., 
Bennett's technique was not efficient enough with memory to move beyond 4- 


gram text) and fast enough to be responsive (e.g., fast 3D rendering enables 
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fluid interactive navigation of virtual space). Implementation specifics may also 
alter the results of processes in ways that can appear profoundly different on 


the surface (e.g., Travesty versus Shannon's implementation and Hellbat). 


Abstract processes are specifications of how a process operates. For n-gram text 
generation, this is the selection of the next letter/word based on the previous 
n-1 letters/words, matched as a pattern in the data text. For “N + 7” this is 
the replacement of each noun in the starting text with the seventh noun after 
it in a dictionary. Almost any abstract process could be carried out through 
(“implemented in”) human effort as well as automatic computation, but for 
many contemporary works of digital media the calculations are so complex that 


they might never complete in a human lifetime of effort. 


Operational logics are elements of systems, supported by abstract processes, that 
are understood to embody an appropriate behavior of that system toward a 
particular end. Generally the understanding that an abstract process supports 
a particular logic is supported by a demonstration (one or more examples). 
Shannon's paper demonstrated that the Markov chain logic could operate as a 
linguistic logic, and later others demonstrated that this linguistic logic could 


operate as a literary logic. 


Works of process are literary works provided without data. Usually these have a 
name (e.g., Travesty or “N + 7”) and they also include one or more abstract 


processes that are asserted to support literary logics, as well as an implementa- 
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tion which may be explicit (e.g., source code or compiled computer application) 


or implied (e.g., Oulipian constraints are generally meant for solution by hand). 


Literary works are, in the case of generators, a combination of data and process that 
can produce many configurations — such as Strachey's love letter generator and 
Queneau's One Hundred Thousand Billion Poems. In the case of works that 
have only one configuration, such as Hartman's “Monologues of Soul and Body,” 


the work and its surface are less differentiated. 


Surfaces are the faces that works turn to their audiences (and perhaps toward other 


processes) as a result of their implemented processes working with their data. 


With this expanded set of concepts and vocabulary we're better able to understand 
the layers of contributions present in many works of digital literature, as well as the 
shifting circumstances that create the conditions of possibility for different works. 
This more detailed model will return throughout this study — for example, in the 


discussion of natural language generation techniques. 


Process variations 


Given the above set of concepts, we can briefly consider another permutation of 
literary process. Let us consider what happens when the data is stripped out of a 
work of process-oriented literature, in order to allow other data to take its place. We 
can see that, in the vocabulary above, this is the act of creating a work of process 
by the removal of elements from a work of literature. An example of this is Florian 


Cramer's (1998) reimplementation of Tzara's newspaper poem process as a web-based 
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CGlI script. The resulting web page performs a computationally-implemented version 
of Tzara's process (without a sack, a hand, or paper scraps of differing sizes) and 
allows the page's visitor to choose to use the text of a newspaper (from a pull down 
menu), the text of a particular web page (there is a form element for entering page 
addresses), or any body of text the visitor may write or paste in (there is a form 
element for entering text). Tzara's process becomes decoupled from the particular 
work, available for experimentation in a manner much like Travesty. 

However, this expanded vocabulary will not help us in all instances. For example, 
we might try to apply similar reasoning to the “computer program for generating 
surrealist “automatic writing' ” printed in Languages of Revolt (Hedges, 1983). This 
presents itself as producing surfaces similar to those created by Soupault and Bre- 
ton's process. We might presume, then, that this program would be supported by the 
same operational logics, if implemented somewhat differently. Recall, however, that 
Soupault and Breton's process is simply an attempt to get in touch with the writer's 
unconscious creativity and record the results — it is a transcription of data from a 
normally inaccessible source. How could this possibly be implemented computation- 
ally? The comments at the beginning of the program listing describe its operations 


as follows: 


This program models surrealist “automatic writing” by the controlled 
matching of randomly selected nouns, verbs and adjectives. Each clause 
may contain up to one “mis-match” of a noun to another noun (posses- 
sion), a noun to a verb, or a noun to an adjective. The continuity from 
clause to clause is usually maintained — by the use of pronoun replace- 
ment of a noun occurring in the previous clause, by conjunctions, and /or 
by semantic field linking between clauses. However, the program also al- 
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lows for breaks in discourse continuity, based on the percentage of such 
breaks observed in the surrealist text “Eclipses.” (p. 141, original in all 


caps) 

While the authors of this program (Inez K. Hedges, Aurthur E. Kunst, and David 
A. Smith, II) have perhaps produced an interesting example of random text gener- 
ation via the Spitbol programming language, this project's processes do not bear a 
resemblance to Soupault and Breton's at any level — implementation, abstract pro- 
cess, or operational logic. But this is a phenomenon well known in computer science, 
and which we will discuss further in later pages. After all, a computer system for 
scheduling air traffic does not need to follow the same process as humans faced with 
the same challenge, and yet can still produce results that are desirable. Similarly, the 
program by Hedges and collaborators can attempt to produce surface results that 
are similar to automatic writing while employing an unrelated process. From the 
point of view of an audience member, presented only with the text (and no context), 
the results may even appear indistinguishable. But from another point of view, the 
specifics of their data and process may reveal more about the nature of their project, 
and its informative differences from Surrealist undertakings, than reading through 
reams of surface output. 

Returning to the more faithful process adaptation of Cramer, it shows, as does 
the example of n-gram text's history, that the elements of literary processes do not 
follow any set path of development. An abstract process can be implemented and 
reimplemented. Á process can be presented as supporting one operational logic and 


another. An operational logic can be used to produce completed literary works, or 
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works of process, or lifted from one literary work to be employed in another. A literary 
work can be created from a work of process, and a literary work can be cleaved in 
two — data and process — in order to make a work of process with one half. These 
can take place at any point in this history of such work. 

This more complex view of literary processes is perhaps most easily grasped in 
the area of our discussion thus far: the textual, linguistic, and literary. In coming 
chapters we will consider a set of efforts of a different sort: ones that seek literary 
results (stories) from non-linguistic logics (such as “planning”). At the same time, 
we will also examine a wider variety of approaches to linguistic generation, not all 
of which have immediately apparent literary applications. Each will deepen our 


discussion of processes. 


Fictional Worlds and Words 


Symbolic Manipulations 


Up to this point we have mainly focused on language. N-grams arrange letters and 
words, Oulipian constraints and transformations operate at the level of words and 
phrases, and so on with Strachey's generator, Tzara's newspaper poems, and our 
other examples. In this chapter, however, we will begin to emphasize systems that 
operate a step removed from language — systems whose processes, at least in part, 
operate at what artificial intelligence practitioners refer to as a “symbolic” level. 
The prior chapter?s discussion of non-player character attitudes in Fable is the main 
example of such a system that we have considered thus far. 

Not coincidentally, this chapter will also begin our examination of systems for 
story generation. These systems focus not on stories as they are experienced by 
audiences (not short fictions, movies, or novels) but on story structures, apart from 
the language or imagery used to portray a fleshed-out version of that structure. This 
chapter will focus on the most famous story generator, James Meehan's Tale-Spin. 


We will look at the specifics of its structures and processes, the context in which it 
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was developed, and the ways in which it might be fruitfully approached in terms of 
fiction. As preparation for the last of these, we will look at the “possible worlds” 
theory of fiction, specifically as outlined by Marie-Laure Ryan. Here we will find a 
set of formulations that, by focusing on fictional worlds — rather than the language 
of fictional discourse, or even the events of fictional stories — help us identify where 
the real “action” is in Tale-Spin.' 

Unfortunately, Tale-Spin*s most interesting centers of activity don't end up being 
visible in the English text produced by its companion program, Mumble. The Mumble 
program is necessary because Tale-Spin does not operate through the manipulation 
of English words, but rather through characters, places, and world states captured in 
a set of symbolic expressions known as “conceptual dependencies.” Mumble operates 
to translate a set of Tale-Spin's conceptual dependencies into English, following a 
basic outline for natural language generation (NLG) that remains close to the current 
practice in the field. We will look at this standard NLG practice, which also carries 
out many of its operations at the level of symbol manipulation. 

Finally, we will revisit interpretations of Tale-Spin from three prominent scholars 
of digital literature: Jay David Bolter, Janet Murray, and Espen Aarseth. We will 
see how a closer examination of some aspects of Tale-Spin's operations might reshape 


their arguments in productive ways. 





l“Discourse,” in this chapter, is not being used in a Foucault-inspired sense, but rather in the 
sense (discussed further later) of the distinction between a fiction's “story and discourse” (its events 
and their portrayal). 


222 


Mis-Spun Tales 


As mentioned above, James Meehan's Tale-Spin (1976) is the most widely discussed 
story generator. Meehan's project holds a prominent position for a number of reasons: 
it was early, it was influential, and — perhaps most importantly — it has proven well- 
suited to making wider points about artificial intelligence. We might say that its most 


famous story is this one: 


Joe Bear was hungry. He asked Irving Bird where some honey was. Irving 
refused to tell him, so Joe offered to bring him a worm if he'd tell him 
where some honey was. Irving agreed. But Joe didn't know where any 
worms were, so he asked Irving, who refused to say. So Joe offered to 
bring him a worm if he'd tell him where a worm was. Irving agreed. But 
Joe didn't know where any worms were, so he asked Irving, who refused 
to say. So Joe offered to bring him a worm if he'd tell him where a worm 
was.... (p. 95) 

This story has been reprinted by a variety of authors, from Roger Schank and 
Peter Childers (in The Cognitive Computer, 1984, 85) to Janet Murray (in Hamlet 
on the Holodeck, 1997, 200) and Espen Aarseth (in Cybertezt, 1997, 131). We might 
point to this wide reprinting, if we wanted to argue that this is Tale-Spin's most 
famous story. 

On the other hand, there is a reason for arguing that the story above is not 
Tale-Spin's most famous tale: it is, put simply, not actually an output from Tale- 
Spin. Instead it is, as Meehan says, a “hand transcription” (p. 93) into English, 
presumably performed by Meehan, of an internal system state (originally represented 


as a set of “conceptual dependency” expressions — on which more later). Further, 


it was produced early in the creation of Tale-Spin, before the system was complete. 
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To publish this as one of Tale-Spin's tales is akin to printing a flawed photograph 
taken with a prototype camera, while it still has light leaks, and using this to judge 
the camera's function. The point of this output was to help Meehan develop his 
system, just as the point of such a photograph would be to help the designers find 
flaws in their camera before it goes to mass manufacture. Surely it makes little sense 
to represent this as one of Tale-Spin's tales. 

But even if we take this as a given, we are still left with a quandary. How do we 
regard this story? More generally, how do we regard the overall group of “mis-spun 
tales” from the Tale-Spin project, hand transcribed for Meehan's dissertation? Jay 
David Bolter (who, in his 1991 book Writing Space, also only reprints mis-spun tales) 
is correct in noting that, on some level, these are “far more interesting” (p. 180) than 
the system's real outputs. Aarseth agrees, asserting that these are Tale-Spin's “real 
successes.” He writes, “They are charming, funny, (unintentionally) ironic; and (the 
final proof of literary merit) they are the ones that are reproduced, circulated, and 
remembered” (p. 131). 

If we accept this, the next question is why these mis-spun tales are “far more 
interesting” than actual output from the Tale-Spin system. Is Aarseth correct in 
citing charm, humor, and irony? Certainly, at a simply textual level, there are far 


better sources of all three. 1 am convinced by a different explanation. 


Well-spun versus mis-spun 


If we are wondering why Tale-Spin*s mis-spun stories are considered its most interest- 


ing products, we might begin by looking at an example of an actual system output, of 
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the sort that Bolter and Aarseth describe as less interesting. Tale-Spin didn't create 
English-language output, but Meehan did write a simple program, called Mumble, to 
turn Tale-Spin story structures into English. Here is a two-stage output from the 


completed Tale-Spin, via Mumble, that appears in Meehan's dissertation: 


Once upon a time George Ant lived near a patch of ground. There was 
a nest in an ash tree. Wilma Bird lived in the nest. There was some 
water in a river. Wilma knew that the water was in the river. George 
knew that the water was in the river. One day Wilma was very thirsty. 
Wilma wanted to get near some water. Wilma flew from her nest across 
a meadow through a valley to the river. Wilma drank the water. Wilma 
was not thirsty. 


George was very thirsty. George wanted to get near some water. George 
walked from his patch of ground across the meadow through the valley 
to a river bank. George fell into the water. George wanted to get near 
the valley. George couldn't get near the valley. George wanted to get 
near the meadow. George couldn't get near the meadow. Wilma wanted 
George to get near the meadow. Wilma wanted to get near George. Wilma 
grabbed George with her claw. Wilma took George from the river through 
the valley to the meadow. George was devoted to Wilma. George owed 
everything to Wilma. Wilma let go of George. George fell to the meadow. 
The End. (p. 164-165, original in all caps) 


Now, here are two mis-spun tales from similar scenarios: 


Henry Ant was thirsty. He walked over to the river bank where his good 
friend Bill Bird was sitting. Henry slipped and fell in the river. He was 
unable to call for help. He drowned. 


Henry Ant was thirsty. He walked over to the river bank where his good 
friend Bill Bird was sitting. Henry slipped and fell in the river. Gravity 
drowned. (p. 94-95)? 





21 have removed a parenthetical explanation of Meehan's from the second of these quotations. 
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All three of these stories are quite strange. But they're not strange in the same 
way. The first story — the successful story, from Tale-Spin's perspective — might 
make us ask, “Why is this language so stilted?” or “Why are these details included?” 
or “What is the point of this story?” The second and third story — the mis-spun 
stories — on the other hand, make us ask questions like, “Why didn't his “good friend” 
save Henry?” or “How is it possible that 'gravity drowned”*”?” 

To put it another way, the first example makes us wonder about the telling of the 
story, while the second and third make us wonder how such a story could come to 
be. We see this reflected in the contexts in which mis-spun tales have been discussed. 
Schank quotes them in the course of a discussion of the difficulties of debugging ar- 
tificial intelligence projects. Murray?'s quotation is in the course of a section on plot 
structures for computer generated narratives. Bolter's is embedded in a discussion of 
the relationship between programmer, computer, and audience when artificial intelli- 
gence is considered as a type of writing. Aarseth's comes in the course of an argument 
that it is unwise to ask machine narrators to simulate human ones. 

In short, each of these authors reprint mis-spun tales in the course of talking about 
systems and structures. And 1 believe that points us toward a more profound reason 
why Tale-Spin's errors are more interesting than its successes: it is only the errors 
that give us an insight into the system.* When Joe Bear repeatedly decides to bring 
Irving Bird a worm so that Irving will tell him where a worm is, we see that there 


must be a mechanism of a certain sort, which has gone awry. A successful output, 





3Wendy Chun has observed that the practice of gaining insight into processes through examining 
errors is well-established. We can trace it through studies ranging from gene function to aphasia to 
slips of the tongue. 
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on the other hand, gives us no insight into the mechanism — denying us not only an 
example of the inherently interesting theme of well-laid plans going wrong, but also 
any insights into what shape those plans assumed. 

I believe that the intuition one can form while looking at Tale-Spin's errors — 
that it is the operations of the system that are most interesting about the project, 
rather than the outputs which mask these operations — is true not only of Tale-Spin, 
but of most projects in the field of story generation. And, as it turns out, by some 
principled definitions this need not keep such projects from being good examples 
of fiction. But to clarify why this is we will need to examine how some theorists 


y) 


understand the term “fiction,” especially in relation to work in digital media. 


Forms of Fiction 


A shorthand definition of fiction might be a listing of its most familiar forms: “novels 
and short stories.” But such a definition won't be much help to us if we are interested 
in thinking about emerging forms. At one point the novel was an innovative new form 
of fiction, and would have been left out of such a definition. Now there are various 
innovative digital forms of fiction emerging. In order to think about new forms, we 
need a more principled definition of fiction. 

In response to this, a number of digital media theorists have begun to work with 
definitions of fiction emerging from possible worlds theory and modal logic. These are 
among a wider group of literary approaches derived from philosophy and linguistics, 


and this particular strand began to establish its own identity in the 1970s. Marie- 
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Laure Ryan, in her Possible Worlds, Artificial Intelligence, and Narrative Theory 
(1992) traces the lineage of this work to the late-1970s essays of Lucia Vania, Umberto 
Eco, David Lewis, and Lubomir Dolezel — followed by the 1980s books of Doreen 
Maítre and Thomas Pavel.* If we look at the first chapter of Pavel's book (Fictional 
Worlds, 1986) he sketches a similar picture of Dolezel and Eco's contributions, while 
also noting the importance of Ryan's early essays on the topic. 

In philosophy the notion of possible worlds is traced back to Leibniz, whose po- 
sition is perhaps most widely remembered as that devoutly held by Dr. Pangloss in 
Voltaire's Candide. Leibniz argued that there are an infinity of possible worlds in 
the mind of God — and that one of them, the best possible, is the one in which 
we live. Leibniz's position is engaged with the most elevated concerns of philosophy 
and theology, but that is not the only place we find possible worlds. Closer to home, 
we imagine possible worlds when we speculate (“I wonder if the lunch special will 
be Lemongrass Tofu”), when we wish (“I hope Jane gets that promotion”), when 
we plan (“We'll get there before the doors open, so we have great seats”), and so 
on. Possible worlds, and modal logic more generally, are tools for thinking through 
versions of such questions: What is possible? What is impossible? What is necessary, 
or contingent? The work of Saul Kripke and others has, especially since the 1960s, 
returned philosophers to thinking through these topics, well after the time of Leibniz. 


A simple attempt at applying possible worlds theory to fiction might propose 





“Ryan specifically mentions these texts: Vania (“Les Mondes possible du texte,” 1977), Eco 
(“Possible Worlds and Text Pragmatics: “Un Drame bien parisien,' ” 1978), Lewis (“Truth in 
Fiction,” 1978), Dolezel (“Narrative Modalities,” 1976), Maítre (Literature and Possible Worlds, 
1983) and Pavel (Fictional Worlds, 1986). 
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that non-fiction texts refer to our world, the real world, while fictional texts refer 
to alternative possible worlds. However, as Ryan points out, there are a number 
of problems with this. For example, this formulation does not provide a way of 
distinguishing between fiction, errors, and lies — all are statements made in reference 
to alternative possible worlds. Also, at a more complex level, there is the problem 
that further alternative possible worlds are continually embedded into both fiction 
and non-fiction. For example, both fiction and non-fiction describe the unrealized 
wishes and plans of the people who appear in them. Ryan's definition responds to 
these issues by identifying a further element, beyond the creation of an alternative 


possible world, that is necessary for fiction. 


Ryan's definition 


Ryan considers the constituent move of fiction not simply the creation of an alterna- 
tive possible world but the recentering of discourse to that world — so that indexical 
terms such as “here” and “now” are understood to refer to the alternative possible 
world, and terms such as “actual” themselves become indexical. Further, for Ryan 
fiction not only creates an alternative possible world, but also a system of reality, a 
universe. This is necessary because the alternative world of a fiction may also have 
many alternative possible worlds emanating from it, and each of them may have fur- 
ther alternative possible worlds (as when one character speculates as to the plans of 
another character). 

Adding the author and audience into the equation, Ryan arrives at the following 


possible-world definition of fictionality: 
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1. There is only one AW [Actual World). 
2. The sender (author) of a text is always located in AW. 


3. Every text projects a universe. At the center of this universe is TAW 
[Textual Actual World). 


4. TTAW is offered as the accurate image of a world TRW [Textual Ref- 
erence World], which is assumed (really or in make-believe) to exist 
independently of TAW. 


5. Every text has an implied speaker (defined as the individual who 
fulfills the felicity conditions of the textual speech acts). The implied 
speaker of the text is always located in TRW. (p. 24-25) 


Of course, one might object that there are many fictions that do not project sta- 
ble, or even logically possible, worlds. Ryan is, of course, aware of this — reminding 
readers that genres such as theatre of the absurd and postmodern fiction may liber- 
ate their universes from principles such as noncontradiction, which form the basis for 
judging the possibility of worlds in modal logic. Instead, Ryan proposes an expanded 
set of accessibility relations that may exist between our actual world and a textual 
actual world. These are they, in order of decreasing stringency (drawing on pages 
32-33): identity of properties (common objects have same properties), identity of in- 
ventory (Actual World and Textual Actual World have same objects), compatibility 
of inventory (Textual Actual World has all Actual World objects, plus additions), 
chronological compatibility (Textual Actual World is not older than Actual World), 
physical compatibility (same natural laws), taxonomic compatibility (same species, 
with same properties — or, a more stringent version also covering manufactured 
goods), logical compatibility (Textual Actual World respects noncontradiction and 


the excluded middle), analytical compatibility (same analytical truths), and linguistic 
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compatibility (language in which Textual Actual World is described can be under- 
stood in Actual World). Ryan is also aware that some texts, such as Beckett's The 
Unnameable, have a radical lack of narrational authority, or by other means make the 
facts of the textual reference world inaccessible (or never describe the textual actual 
world, or make it undecidable which world presented by the text is the textual actual 
world, and so on). In these cases, Ryan views the accessibility relations as wholly or 


partially undecidable. 


Fiction and its depiction 


Of course, we do not normally think of a fiction as simply a possible world. We 
also think of the narrative that takes place in the fiction's world: the events, the 
imaginings, and the (causal) connections between them. But part of the point in 
applying possible worlds theory to fiction is to help distinguish between narrative 


and fiction. As Ryan puts it: 


Narrative, like fiction, is supported by truth-functional mimetic state- 
ments. Both are rooted in the declaration: once upon a time there was 
an x, such that f(x). In narrative, this statement may be uttered either 
from AW [Actual World] or from the actual world of a recentered system; 
but in fiction, recentering is mandatory. (p. 259) 
This distinction points toward the fact that we have non-fiction narratives (e.8., 
biographies). Equally useful, however, is the way that possible worlds theory helps us 
understand non-narrative fictions. Ryan identifies postmodern fictions as the most 


prominent examples of these for literary theorists, and for the most part non-narrative 


fictions are seen as a relatively small subset of fictional texts. For some theorists of 
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digital media, on the other hand, non-narrative fiction is seen as the primary category 
— as we will soon discuss. 

Before we turn to that discussion, however, it is also useful to consider another 
way that we normally think of fiction. We don't just experience the fiction's narrative 
or non-narrative — we also experience the particular way that the (non-)narrative 
is presented: the order in which elements are revealed to the audience, the specific 
language used, and so on. In literary theory this is sometimes referred to as the 
distinction between “story” and “discourse” (or, in some more specialized contexts, 
“fabula” and “sjuzhet”). 

One important aspect of a possible worlds approach to fiction is that it does not 
specify much about discourse.? The discourse could be pure text, as many literary 
theorists assume, or 1t could be a mixture of text with other media elements (sound, 
images), or even atextual. For Ryan the important point is whether the elements 
used to portray the world are in part drawn from an artist's imagination. She writes, 
“Comic strips are fictional because the characters are invented, and the artist merely 
pretends to have a visual and auditive source. Political cartoons represent real-world 
individuals, of whom the cartoonist may have a genuine visual source, but these 
individuals are placed in imaginary situations.... As for movies, they are nonfictional 
when the camera captures genuine events and fictional when the recorded events 


are simulated by actors” (p. 101). Ryan similarly differentiates between staging and 





6 


5Placing it among a set of “narratological” approaches that have inverted the “new critical” 
focus of literary scholarship on the specifics of language (and therefore on critical practices such as 
close reading). Narratologists, instead, focus on issues such as plot structures, the ordering of events 
(in the plot and in the telling), the different levels of narration present in a fiction, and so on. 
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pretending — a staged event is still a real event, as when a weaver practices her craft 
for the benefit of a documentary film crew. She is not pretending to weave. 

Of course, for theorists of digital media interested in forms ranging from email 
to video games, it is important to have a way of discussing fictions expressed both 
purely textually and by other means. This, again, becomes an attractive feature of 
the possible worlds approach to fiction. However, this broadening may also cause 
digital media theorists to regard “fiction” as an aspect of their object of study, rather 


than as a way of naming the type of object. 


Digital possible worlds 


Jill Walker is a digital media theorist interested in many forms of fiction with a 
strong textual component, including email narratives and weblog fictions, as well 
as some without text as a primary component. Her Fiction and Interaction: How 
clicking a mouse can make you part of a fictional world (2003) is an examination 
of interactive digital fictions that somehow include the user as a character in their 
alternative possible world. She introduces the term “ontological interaction” in the 
course of describing how users are included in these worlds, building particularly on 
the notion from Thomas Pavel's book of “ontological fusion” (between our actual 
selves and fictional selves when engaging with fictional worlds) and Kendall Walton's 
theory of how we use fictional representations as props in a game of make believe. 
(She cites Walton's 1990 Mimesis as Make-Believe, though earlier formulations of 
Walton's thinking are cited in both Pavel and Ryan's books.) Walker notes that this 


approach provides a vocabulary for discussing the common generation of fictional 
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experience in works “as disparate as installation artworks, interactive narratives and 
computer games” (p. 31). However, it also causes her to abandon the use of “fiction” 


as a name for a type of object. She writes: 


Fiction, in my view, is not a genre. It belongs to a different class of 
concept than game, image, narrative, novel, poetry or concept art. These 
are formal genres, which we classify according to their formal qualities. 
Fiction is not an object, it is a process, a fantasy emerging from the 
meeting of user and work. (p. 18) 


In a related vein, Jesper Juul's Half- Real (2005), quoted in an earlier chapter, takes 
computer games as the primary objects of study. Given this, Juul is not interested 
in “fiction” as a category of artworks, but as something that is an element of many 
games. He is interested in understanding how games project fictional worlds: what 
kinds of worlds, how players are cued to imagine them, and how those worlds relate 
to the games” rules. Juul notes that game fictions, like those of traditional literature, 
are incomplete. Just as in Hamlet we know little of the world outside the castle 
and its immediate vicinity — but on some level assume it is embedded in a fully- 
detailed world partially filled in by knowledge from our own world and other texts — 
so in the game Half-Life we know little of the world outside the complex where it is 
set. However, unlike most literary fictions, the worlds of many games are, in Juul's 
terminology, “incoherent” (which is one of the things that limits Juul's interest in 
discussing games in terms of narrative, as opposed to fiction). These are worlds in 
which significant events take place that cannot be explained without discussing the 
game rules, such as the many games that feature multiple lives and extra lives without 


any element of the game fiction that points toward reincarnation. Games may also 
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have even looser connections with fictions, such as iconic meaning (the members of 
the royal court in a standard deck of cards) or the embedding of abstract games 
within a wider fictional frame (as in Wario Ware Inc., where the games are presented 
as having been created by fictional characters). Juul points to a number of ways that 
games cue players into imagining fictional worlds, including: graphics; sound; text; 
cut-scenes; game title, box, and manual; haptics; rules; player actions and time; and 
rumors (p. 133-138). 

What remains for Walker and Juul, who are not concerned with fiction as the 
name of a type of artwork, is fiction as a type of imagination in the mind of the 
audience — the imagination of a possible world. It is a departure from our everyday 
use of the term “fiction” that allows them to reach valuable insights about a variety 
of forms of digital media. 

In this study I will retain “fiction” as a noun, as an umbrella term for works that 
create possible worlds and recenter our perspective on them. At the same time, l will 
perform a broadening move of the sort important to Walker and Juul's arguments — 
one that might be seen as the inverse of those that support their work. 1 will explore 
the notion of “internal” fictions, produced by the operations of the system but only 


partially made accessible to the audience. 


Story generators as fiction 


An interesting feature of Ryan's book, for our purposes here, is that it contains both 
an important contribution to the theory of fiction via possible worlds and a careful 


consideration of a number of models from artificial intelligence. Included among this 
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latter group of topics is a chapter on story generation that discusses Tale-Spin and 
other projects. 

One might expect that, when Ryan's chapter on story generation opens with a 
discussion of Tale-Spin, one of the first orders of business would be to outline the 
relationship of this project to the book's version of the theory of possible worlds. 
However, just the opposite takes place. Ryan does not explain how Tale-Spin is the 
product of an author in the actual world, projecting a textual actual world (offered 
as the image of a textual reference world, in which its implied speaker is located), 
and surrounded by a universe of its own alternative possible worlds. Rather, she 
suggests an idea that is interesting, but on first inspection may seem unrelated: that 
we may judge story generation projects based on the criteria of creativity, aesthetic 
awareness, and understanding. 

None of these criteria are related to the communicative context in which Ryan's 
possible-worlds definition of fiction is situated. Instead, these are all focused on the 
internal representations and operations of the story generation systems. And, further, 
Ryan does not quote an output from any of the story generation systems she discusses, 
not even one of the mis-spun tales questionably attributed to Tale-Spin. 

What she does instead is quite intriguing. She uses the language from earlier in 
her book in the process of describing the operations of these systems. For example, 
she writes of the “automatic novel writer” created by Sheldon Klein and collaborators 
in the early 1970s (Klein, Oakley, Suurballe, and Ziesemer, 1971; Klein, Aeschlimann, 
Balsiger, Converse, Court, Foster, Lao, Oakley, and Smith, 1973; Klein, Aeschlimann, 


Appelbaum, Balsiger, Curtis, Foster, Kalish, Kamin, Lee, Price, and Salsieder, 1974): 
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The real breakthrough resides in the program's ability to represent the 
modal structure of the narrative universe. The simulative algorithm 
decomposes this universe into a plurality of worlds, and assigns each 
narrative proposition to a specific domain. The characters of a story 
are not only defined by their physical, objective properties, but also by 
their correct and incorrect beliefs, their system of affective values, their 
goals, plans, fears, and rules of behavior. The sum of these mental con- 
structs forms a self-enclosed subsystem within the global narrative uni- 
verse. Klein observes that the mechanisms that produce the story by ma- 
nipulating the data and rules of the global universes can easily be made 
to operate within the confines of a private domain. When a subsystem 
related to a particular character momentarily takes the place of the global 
semantic universe, the program's operations reflect the point of view of 
the character in question. By generating stories in the mind of characters, 
a program can simulate the mental processes of looking ahead, imagining, 
dreaming, and hallucinating. (p. 242) 


In other words, Ryan describes the system as generating a “global narrative uni- 
verse” that contains characters who project alternative possible worlds, and then 
as able to recenter temporarily to these alternative worlds. This sounds very much 
like Ryan's definition of fiction, except without the communicative context. It is as 
though the system is acting as its own audience (which is in some ways the point of 
Ryan's previously-mentioned criteria of “aesthetic awareness” and “understanding” ). 

Why doesn't Ryan simply talk about the system's output, making it possible to 
situate the discussion in a manner closer to her possible-worlds definition of fiction? 
One possible answer is found on page 240, where Ryan observes of the automatic 
novel writing system, “the program's knowledge of the narrative universe is far more 
extensive than the printed output suggests.” It may be that little of what interests 
her is visible in the system's outputs. But it is also important to note that, for all of 


the story generation systems that have emerged from an artificial intelligence context, 
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story and discourse are separated. Some systems, as noted above of Meehan's Tale- 
Spin, may have no inherent discourse capabilities at all — while others simply view 
discourse as a secondary concern. Within a computer science tradition, it is common 
to view the creation of the actual text of a story as a problem of “natural language 
generation” (NLG), and distinct from story generation (which is focused on story 
structure). 

So in focusing on the internal processes and configurations of story generators, 
Ryan is concentrating on the elements of story generation systems that are the main 
area of work for their creators. By finding the elements of possible worlds within 
these systems, Ryan gives us the sense that these systems create fiction. Following 
the implications of these two moves suggests a formulation never stated by Ryan: 
story generation systems can create fictions by simulating possible worlds internally, 
independently of what (or whether) output is produced. We might not view possible 
worlds internal to computer systems as full fictions — even if events take place in these 
worlds, they are story without the possibility of discourse, without the possibility of 
audience. But considering these “internal” fictions will give us insight into story 
generation systems that are not possible when only considering the outputs. 

This formulation could be seen as an inversion of the move that supports Walker 
and Juul's work, for which it is the audience's imaginings that is central (what we 
might call “external” fictions). But I do not believe that the two perspectives are 
incompatible. They are both suggestions of new ways that we might consider fiction 
that will help us interpret digital media artifacts. 


Of course, if we accept this suggestion, it means that in order to understand 
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the full picture of these fictions — story and discourse, internal state and audience 
experience — we will eventually need to consider not only story generation systems 
but also something of the NLG systems that (explicitly or implicitly) accompany 
them. This chapter will follow this approach, looking at Mumble and its context 


after discussing Tale-Spin. 


Tale-Spin 


The worlds of Tale-Spin, like those of Aesop's fables, are populated by anthropomor- 
phized animals who act out allegorical stories meant to illustrate human nature. The 
view of humanity animating Tale-Spin, however, is not Aesop's. Rather, it is a vision 
crafted by artificial intelligence researchers in the 1970s — particularly by linguist 
and computer scientist Roger Schank and psychologist Robert Abelson. 

Schank, who would later quote some of Tale-Spin's mis-spun tales in The Cognitive 
Computer, was the advisor for Meehan's dissertation work — of which Tale-Spin was 
the result. Schank and Abelson were at that time two of the primary proponents 
of what would come to be called a “scruffy” approach to artificial intelligence. As 
one might imagine, this was in contrast to “neat” researchers — who sought general 
approaches to intelligent behavior through logic-based paths to problem solving (such 
as theorem proving). Scruffy research, on the other hand, sought intelligent results in 
various domains (such as understanding or generating plans) by speculating as to the 
knowledge structures and behaviors humans use, creating formal (and computable) 


representations of these, and hand-coding versions of them for particular contexts 
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into computer systems. A number of significant story generation projects have grown 
out of this approach, and out of Schank's laboratory in particular. Tale-Spin's status 
as the first of these may be another reason for its relative fame. 

Tale-Spin's approach to fiction is to begin with a simulated world — a fictional 
world that isn't simply portrayed (as in a novel or movie) but actually made to oper- 
ate. When one part changes, or a new part is added, the rest of the world reacts ap- 
propriately — if it is something significant to the story, the story changes. The world 
is represented and operates according to Schank and Abelson's theories: the “concep- 
tual dependencies” mentioned earlier, a set of “delta-acts,”* a set of “sigma-states,” 
and other structures (which will be briefly explained below). Meehan describes it in 


these terms: 


Tale-Spin includes a simulator of the real world: Turn it on and watch all 
the people. The purpose of the simulator is to model rational behavior; 
the people are supposed to act like real people. (p. 80) 


However, Meehan does not argue that simply simulating the real world always 
creates interesting stories. Tale-Spin does include two modes based on experimen- 
tation with the simulator, which Meehan says are “intended to model someone who 
makes up a story on the fly” (p. 80). But it also includes a mode of storytelling that 
fixes the contents of the world in advance, in order to guarantee a good story. If the 
right characters and props are in place when the particular fictional world comes into 


existence, something interesting will happen when the simulation is carried out. 





SMeehan calls them “delta-acts,” while Abelson and Schank call them “deltacts” in their 1975 
publications (Schank, 1975b; Abelson, 1975). 1 will alternate, depending on whose work TI am 
referencing. 
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Elements of Tale-Spin's simulation 


A foundation of the Tale-Spin system is the notion of “conceptual dependency.” 
As Schank describes it in Conceptual Information Processing (1975a), Conceptual 
Dependency Theory grew out of computational linguistics. While computational 
linguistics deals with language, Schank and his collaborators felt the need for a repre- 
sentation of meaning, of semantics, that was independent of any particular language. 
Such a semantic representation could serve as an interlingua, so that the translation 
of a statement from any one language to another could be accomplished by translating 
that statement to and from the semantic representation. Such a representation could 
also serve as an internal meaning representation for Al projects, helping in processes 
such as paraphrasing and inference-making. 

As presented in Conceptual Information Processing, a conceptual dependency ex- 
pression is a set of conceptual relationships that follow certain rules of “conceptual 
syntax.” There are Picture Producers (PPs) which can carry out ACTs — the prim- 
itive actions from which all others are assumed to be built. There are only eleven of 
these ACTs. Pl briefly describe them, drawing on Schank's pages 41-44. First, these 
seven are the relatively concrete ones: PROPEL (apply force to), MOVE (move 
a body part), INGEST (take something inside an animate object), EXPEL (force 
something out of an animate object), GRASP (physically grasp an object), SPEAK 
(produce a sound), and ATTEND (direct sense organ toward, or focus it on, some- 
thing). Second, these four ACTs are more abstract: PTRANS (change the location 
of something), ATRANS (change some abstract relationship with respect to an ob- 


ject), MTRANS (transfer information — either within a person's memory or between 


241 


people), and MBUILD (create or combine thoughts). ACTs can also have objects 
that are PPs (in Schank's example, if John eats a frog, the frog is an object of his 
INGEST), have directions that are locations of PPs (the frog moves from somewhere 
to John's mouth), have instruments that are entire conceptualizations (John moving 
his hand to his mouth to get the frog there), and change the value of states (this 
probably makes John's health have a lower value, and certainly so for the frog). Con- 
ceptual dependencies can also represent facts about the state of the world, in addition 


to acts that take place in it. 


Sigma-states and delta-acts 


The world of Tale-Spin revolves around solving problems, particularly the problems 
represented by four sigma-states: SIGMA-HUNGER, SIGMA-THIRST, SIGMA- 
REST, and SIGMA-SEX. These, as one might imagine, represent being hungry, 
thirsty, tired, and — in the system's terminology — “horny,” respectively. At the 
time Tale-Spin was written, Schank argued that people understand most situations 
using “scripts.” A script is a well-understood set of possible actions, used for familiar 
situations. So, for example, we might address SIGMA-HUNGER with a script for 
cooking, or grocery stores, or restaurants, etc. 

But, as Meehan puts it, scripts are “so developed that they're uninteresting: not 
great story material” (p. 154). So the characters of Tale-Spin must always use the 
approach that Schank argued we fall back on when no appropriate script is avail- 


able: planning.” Schank characterizes planning this way in his (1975b) paper “Using 





"Schank's notions of scripts and plans were first developed in the context of story understanding, 
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Knowledge to Understand”: 


A Plan is the name of a desired action whose realization may be a simple 
action (la] conceptualization involving a primitive ACT). However, if it 
is realized that some state blocks the doing of that action, the plan may 
be translated into a deltact to change the state that impedes the desired 
ACT. Thus, a Plan has attached to it a group of deltacts with tests for 
selecting between them. (p. 119) 


So, for example, if — by unlikely circumstance — John forms a plan to eat when 
he already has a frog in front of his mouth, all he has to do is the primitive ACT 
of INGESTing the frog. But if John does not already have a frog in front of his 
mouth, the plan will be translated into a deltact with the goal of getting a frog (or 
perhaps some other food) in front of his mouth. He may choose one that involves 
moving some food there with his hand. If he does not have any food in his hand, 
getting some food there may become a state-change goal. This idea of deltacts (or, 
as Meehan calls them, “delta-acts”) comes from Abelson. He describes his work on 
them as an attempt to “develop a set of intention primitives as building blocks for 


plans. Each primitive is an act package causing a state change. Each state change 





rather than story generation. Schank echoes Meehan's evaluation of the relative interest value of 
stories about scripts versus those about plans in Inside Computer Understanding (1981): 


Many stories (particularly the more interesting ones) are about entirely new situations 
for which we have no available script. When we hear such stories we rely on our 
knowledge of how to plan out an action that will lead to the attainment of a goal. 


(p. 33) 


Schank and Meehan are not alone in linking art to the unscripted, the unexpected, and even the 
disorienting. Among artists and humanists interested in link-based hypertext its potential for such 
effects has been cited as one of its potential benefits, though in more technically-oriented circles 
disorientation was seen as one of hypertext's large potential problems. George Landow, in Hypertext 
3.0 (2005), has explored this difference through the work of writers such as Morse Peckham (author 
of Man's Rage for Chaos). 
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helps enable some later action, such that there is a chain or lattice of steps from the 
initial states to the goal state” (Abelson, 1975, 5). Schank embellished the idea of 
delacts with what he calls “planboxes.” These are specific approaches to achieving 
the state change. 

For Tale-Spin Meehan implemented three delta-acts which correspond to the prim- 
itive ACTs PTRANS, MTRANS, and ATRANS. The first, DELTA-PROX, is used 
when someone wants to change the state of something to make it near something 
else, which includes changing one's own state to be near something. (DELTA-NEG- 
PROX is used to get something away from somewhere.) DELTA-KNOW is used 
when someone wants to change the state of someone to know something. Meehan 
calls his procedure for the communicative version of this “PELL” and the procedure 
for wanting to find something out oneself “DKNOW.” Finally, DELTA-CONTROL 
is used when someone wants to acquire something. 

Given the nature of Tale-Spin, many of the planboxes of these delta-acts depend 
on somehow persuading another character to do something (e.g., we have seen Joe 
Bear try to convince Irving Bird to tell him where a worm is). These common 
planboxes are gathered together into a PERSUADE package, which includes: simply 
asking; linking the action to a goal the other character is presumed to have, and 
enabling the action; bargaining; and threatening. Like all planboxes, each of these 
has preconditions — a character does not ask another to simply tell her something 
if she believes that the other intensely dislikes her. Similarly, a character does not 


threaten a character she likes a lot. 
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An implemented world 


In order to build Tale-Spin, Meehan created concrete implementations of Schank and 
Abelson's abstract structures and processes. Tale-Spin is written in the program- 
ming language MLISP (and ALGOL-like language) which is translated to run in the 
LISP environment. Data files are either LISP code or the parenthesis-structured “S- 
expressions” processed by LISP. So in Tale-Spin the conceptual dependency “John is 


in New York” would be represented as: 


((ACTOR *JOHN* 18 (*LOC* VAL (*PROX* PART *NEWYORK?*)))) 
(p. 141) 


Tale-Spin also contains mechanisms for assertion and inference. Assertion adds 
conceptual dependencies to what is known about the world. So, at the outset of a 
story, it might be asserted that Joe is a bear, that Joe exists, that there is a cave, 
it is Joe's cave, and that Joe is in the cave. The inference mechanism would draw 
many further conclusions from this, including that Joe knows he is in his cave. Many 
other things, however, would still be unknown — such as whether Joe is hungry. 
In conceptual dependency expressions, hunger is (like most states, the exceptions 
being measurements like height) represented by a value between -10 and 10. SIGMA- 
HUNGER represents the goal of going from a negative value to a positive one by 
obtaining some food (DELTA-CONTROL) and eating it (INGEST), after which there 
are two inferences: the character is less hungry and the food is gone. This is the 
starting point for many Tale-Spin tales, as getting control of some food may be a 


very complicated process. 
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The assertion and inference mechanisms create, and record the changes of, ev- 
erything in a Tale-Spin world: characters, props, and landscape. Asserting an act 
by a character, for example, adds it to the world's history/memory, along with all 
its consequences (which are largely determined by inference). Facts about the world 
are also indexed based on which character believes them, with those indexed for the 
system being those that are true of the world. All this is represented as conceptual 
dependency expressions, and it is a subset of these that, after or during a run of Tale- 
Spin, are translated into English. This can take place automatically, via Mumble, or 
“by hand.” 

Now, with an understanding of these elements and their backgrounds, let's look 


at the Tale-Spin story world in action. 


Spinning tales 


Tale-Spin has three story generation modes. In the first two modes, the simulation 
tells a story that is based on information about the world requested from the audience. 
In mode 1 certain conceptual dependency expressions are sent to Mumble as they are 
generated, resulting in the story being told incrementally, interspersed with questions 
for the audience. In mode 2 the same questions are asked, but there is no Mumble 
output until all the necessary questions for the story have been asked. Both of these 
modes are described by Meehan as “intended to model someone who makes up a 
story on the fly.” Mode 3, on the other hand, is meant to create well-formed tales by 
“fixing” the world appropriately before the simulation begins. A number of Meehan's 


mis-spun tales originated with his attempts to fix the world to generate Aesop's “The 
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Ant and the Dove” and “The Fox and the Crow.” In this section 111 follow the outline 
of Meehan's chapter 11, in which he goes through the generation of a complete tale 


in a question-asking mode. 


Initiating the world 


The fictional world described in chapter 11 of Meehan's dissertation comes into exis- 
tence when Tale-Spin asks what characters to use for the story, and the reader chooses 
a bear and bird. Each choice results in a “picture procedure” that does more than 
assert the existence of the character. For example, that for the bear (by assertion and 
its effects) creates a bear, chooses the bear's gender, gives the bear a name, adds the 
bear's name to the list of story characters, chooses a height and weight for the bear, 
creates a cave, creates a mountain in which the cave exists, marks the cave as being 
the home of the bear, and lets both the bear and the system know that the bear is in 
the cave. In Meehan's story the bear is named Arthur, and by similar means a bird 
named George is created (along with a nest, a maple tree for the nest, and a meadow 
for the tree). Each character is added to the other's list of acquaintances, so they 
know that each other exist, and they each have an idea of the other's location (at 
first, an accurate idea). Already we have an alternative possible world — perhaps a 
fiction — but not yet any story within it. 

Next, the audience is asked whether any miscellaneous items should be created: 


” ( 


“berries,” “flower,” “river,” and “worm.” The audience chooses a worm, and it is 
created along with the ground where it is currently located. The audience is asked 


who knows about the worm, and chooses Arthur Bear. This knowledge is now added 
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to memory, along with all of its inferences. If, for example, worms were something 
bears ate, and Arthur was hungry, the inference mechanism might initiate a plan 
for him to eat it. However, in Tale-Spin's worlds bears do not eat worms. Birds, 
however, do. 

Tale-Spin now has enough information to ask who the main character of the story 


will be: 


THIS IS A STORY ABOUT ... 
1: GEORGE BIRD 2: ARTHUR BEAR 


The audience chooses Arthur, and then asked which sigma-state presents his prob- 
lem: 


HIS PROBLEM IS THAT HE IS ... 
1: HUNGRY 2: TIRED 3: THIRSTY 4: HORNY 


Hunger is chosen as his problem — and then the story, which is to say “the activity 
of planning,” begins. One may remember that Meehan rejected structuring Tale-Spin 
around scripts — though in Schank's lab that would have made the simulation seem 
more realistic — in favor of plans. The reason Meehan gave was that scripts are 
“not great story material,” and the implication of this is that plans are good story 


material. In fact, planning is the primary literary logic at work in Tale-Spin. 


Making a plan 


In the case of Arthur's problem, Tale-Spin does not actually assert that he is hungry. 


Rather, it asserts something closer to “Arthur knows that he is hungry” — and from 
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this the inference is that “Arthur knows that Arthur intends not to be hungry.” The 
name of the Tale-Spin procedure for satisfying hunger is S- HUNGER (for SIGMA- 
HUNGER). In Tale-Spin bears eat honey and berries, so S- HUNGER checks memory 
to see if Arthur knows that he owns some honey or berries, or knows where some honey 
or berries are. He doesn't, so S- HUNGER selects a bear food at random, in this case 
honey, and invokes DCONT (DELTA-CONTROL) to get some. 

DCONT forms the goal “Arthur has some honey” and starts checking the precon- 
ditions for its planboxes. First it checks to see if it is already true (it is not). Then 
it checks to see if this goal is already part of Arthur's goal structure. It's not, so it is 
added. 

Why does DCONT check to see if getting honey is already part of Arthur's goal 
structure? For a number of reasons. Most obviously, because if it is already his goal 
(or a subgoal toward a higher goal) then it makes little sense to add it. But another 
reason to check for the goal's existence is that Tale-Spin also keeps failed goals, and 
the reasons for their failure, as part of a character?s goal structure. Before this was 
added to the system it was easy to create mis-spun tales like the one quoted at the 
beginning of this chapter — Tale-Spin's best-known product: Joe Bear forming the 
goal of bringing Irving Bird a worm over and over. 

The first precondition for DCONT"s planboxes is knowing the location of the thing 
to be controlled. Since Arthur does not know where any honey is, DKNOW is called. 
DKNOW forms the goal “Arthur knows where some honey is.” It's not already 
true, and it's not yet part of the goal structure, so it is added. The first planbox 


of DKNOW is used if there is a standard method for finding the information: like 
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checking a watch or clock to learn the time. There isn't one of these for honey in 
Tale-Spin, so the next planbox of DKNOW is to see if there's someone who is an 
expert on these matters to ask. Bears are experts at honey, but Arthur's the only 
bear he knows about, so Tale-Spin moves to the next planbox, which is to use the 
methods of the PERSUADE package to convince a friend to tell you. A friend is 
someone that you think relates to you with a positive value for affection. Arthur's 
opinion of George Bird's opinion of him is unknown. If the audience says that Arthur 
doesn't think they're friends, Arthur will have only one DKNOW planbox left before 
he has to give up on the goal of having some honey: PERSUADing an agent to find 
out the answer and tell him. 

Relationship states and personality states are extensions to Schank and Abelson's 
work created by Meehan for his work on Tale-Spin (p. 30). Testing memory for a 
relationship state is done by a procedure that adds the state to memory if it isn't 
there. Since Tale-Spin is in the mode of asking the audience, it types: 


DOES ARTHUR BEAR THINK THAT GEORGE BIRD LIKES HIM? 


l: A LOT 2: A LITTLE 3: NOT MUCH 4: NOT AT 
ALL 


Tale-Spin can simultaneously maintain four different states that are similar to this 
in memory. The first, this one, is: Does Arthur think that George likes him? The 
others are: Does Arthur think that he likes George (i.e., does Arthur like George)? 


Does George think that Arthur likes him? Does George think that he likes Arthur 





8 According to page 37 of Meehan's dissertation. The description of this portion of chapter 11's 
story, on page 124, elides this step. 
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(i.e., does George like Arthur)? All are used in making different types of plans. 

The audience says that Arthur thinks George likes him a lot, so PERSUADE 
starts working through its planboxes with the goal of Arthur getting George to tell 
him where some honey is, in pursuit of the higher goal of getting some honey. The 
first planbox is to simply ask, but it has further preconditions. So Tale-Spin asks if 
Arthur feels deceptive toward George (the audience answers: not at all), if Arthur 
feels competitive toward George (not at all), if Arthur likes George (a lot), and if 
Arthur thinks that George is trying to deceive him (not at all). 

Finally, Arthur Bear has a plan to know where some honey is, so he can control 


it, so he can eat it, so he can be less hungry: he”ll ask George Bird. 


Speculations and lies 


From here things begin to go poorly for Arthur. It turns out that George is a deceptive 
bird, and he will deliberately tell Arthur about an alternative possible world that isn't 
the one of this Tale-Spin story. That is, he's going to lie, and Tale-Spin is going to 
create a parallel world structure to support this lie, representing the fact that Arthur 
believes it. 

First, Arthur wants to TELL George his request. TELL has two planboxes: do 
it yourself, or get an agent to do it. Arthur starts with the first one, which requires 
that he be near George. This is a DELTA-PROX problem, resulting in the formation 
of the goal “Arthur is near George,” and the appropriate planboxes of DPROX start 
going. Since Arthur knows where he is (the cave) and thinks he knows where George 


is (his nest), there's no need to start up DKNOW planboxes. DELTA-LINK then 
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creates more of the world, so that there's a connection between Arthur's mountain 
and George's meadow. Then DO-PTRANS moves Arthur along the ground (bears 
cannot fly) to near George's maple tree. The inferences from this include Arthur 
and George both knowing that Arthur has arrived there, and Arthur knowing that 
George is there. (Before these sorts of inferences, characters had to SPEAK to let 
others know of their presence. This created the mis-spun tale in which Henry Ant 
drowns while his friend Bill Bird sits unnoticing.?) Now that everyone knows Arthur 
and George are near each other, DO-MTRANS is called and Arthur asks George to 
tell him where some honey is, resulting in a call to the procedure for handling favors: 
REQUEST. 

Now there is reason to ask the audience how George views his relationship with 
Arthur. Tale-Spin asks if he likes Arthur (answer: a little), if he trusts Arthur (a 
little), if he thinks he dominates Arthur (not at all), if he feels indebted to Arthur 
(not at all), and if he thinks that Arthur trusts him (a little). From this George 
concludes (using Tale-Spin's inference mechanisms) that Arthur will believe him if 
he tells him where some honey is (a possible world), but this isn't enough to help him 
decide whether he'll be happier if he answers Arthur's question. We might say that 
George next “speculates” as to what will happen if he gives an answer and Arthur 
believes him (creating more possible worlds). He does this by making an inference 


that Arthur believes the honey is where he said it was, drawing four inferences from 





“The story in which “gravity drowns,” on the other hand, was created by having gravity as a 
character that performed a GRASP on Henry Ant and then PTRANSed them both into the river. 
Henry got rescued by his friend, but gravity had no way out. This was fixed by having gravity 
PROPEL Henry instead. 
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that, and then making the inferences from believing each of those four inferences. 
But none of the multi-level inferences he draws from Arthur having that belief point 
one way or the other — none of these possible worlds seems better or worse than any 
other, or better or worse than the current one. George was looking for an inference, 
from the perspective of one of these possible worlds within worlds, that said he was 
happy or sad, but did not find one. Then something very strange happens. 

Tale-Spin doesn't decide whether George will answer by random selection. Rather, 
it decides based on how kind George is. The audience responds that he is “somewhat” 
kind, so he decides to give Arthur and answer “out of the goodness of his little heart,” 
as Meehan puts it. But when George Bird calls DO-MTRANS, this motivation isn't 
communicated — and DO-MTRANS decides to ask the audience whether George feels 
deceptive toward Arthur. The answer is “a lot,” so George ends up lying to Arthur 
about the location of honey out of the goodness of his heart. This isn't a simulation of 
George thinking Arthur needs to diet, but a breakdown in the simulation — though 
Meehan passes over it without comment. 

In any case, when George lies to Arthur the entire structure of what he tells him 
is created in the course of the DO-MTRANS, but some of it only indexed to character 
memory, not the system's memory. This leads to the creation of Ivan Bee, who owns 
honey, which is in a beehive, which is in a redwood tree, which surrounded by some 
ground, which is part of a valley. The physical locations are indexed to the system 
memory: they're real. But at first the hive, bee, and honey are only in George's 
memory, and not believed. When he tells Arthur about them, the audience needs to 


be asked how much Arthur trusts George. The answer is “a little,” and the inference 
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maker determines that they are added to Arthur's beliefs about the world. 


Another trick 


Now Arthur thinks he knows where to find some honey. As Meehan puts it: 


DO-MTRANS returns to REQUEST which returns to the first call of 
DO-MTRANS which returns to TELL which returns to PERSUADE. 
The goal of the persuasion has been satisfied — Arthur now thinks he 
knows where there's some honey — so PERSUADE returns to DKNOW 
which returns to DCONT.... All this has gone toward achieving only the 
first precondition of DCONT, finding out where some honey is. (p. 134) 


Nothing goes well for Arthur from here. The audience is asked how he feels toward 
Ivan (real feelings, in Arthur's reference world, even if toward a fictitious being in this 
Tale-Spin world) and he decides to ask Ivan for some honey. Arthur travels to where 
he believes Ivan can be found, but when he gets there TELL fails its precondition 
tests — he's not “close” to Ivan, because Ivan's not there. Arthur infers that George 
has lied, and proceeds to distrust him, like him somewhat less, and believe George is 
trying to deceive him. However, Arthur hasn't stopped believing that Ivan exists, but 
only that he knows where Ivan exists. As it turns out, there's no one to ask where 
Ivan really is except George. 

So Arthur heads back to George's tree. Given that asking George where Ivan is 
still exists as a failed goal, he tries the next PERSUADE planbox: bargaining. He 
infers that George would probably like a worm, and the initial setup determined that 
Arthur knew the location of a worm. So Arthur offers George a worm if he will tell 


him where to find Ivan. George promises to do so (a possible world), while making 
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the decision to trick Arthur (a different possible world, which will only come to exist 
if triggered by the worm's arrival). After Arthur succeeds in getting the worm and 
bringing 1t to George, the bird eats it and makes fun of him. With this planbox 
having failed, the next is to threaten. But the audience, when asked, asserts that 
Arthur feels a little dominated by George, so he won't try it. 

Having failed to find a way to ask Ivan for honey directly, the next planbox of 
TELL moves Arthur to try to get someone else to ask Ivan to give him honey. He 
tries with George — there's no one else — but this fails even more pitifully than 
his last attempt. Things continue to unravel until Tale-Spin decides there's nothing 
more that Arthur knows to do to try to address his hunger. 

His options are particularly limited because there's no way for him to go looking 
for food — if he cannot convince someone else to tell him, there's no way to find out. 
And, as we remember from the set up for the story, no one knows about any bear 
food in this world. Between Arthur and George they only know about one worm. So 
the story could not have turned out any other way. As soon as the audience decided 
that hunger was Arthur's problem he was doomed. He made many plans, none of 


them had a chance of working, the end. 


The worlds of Tale-Spin 


In Italo Calvino's Invisible Cities (1974) two characters named Kublai Khan and 
Marco Polo sit in a garden. Polo tells the Khan — sometimes in words, sometimes 
through symbols, sometimes through the relation of pieces on a chessboard — of cities 


he has visited within the vast empire. Here are a few. In the middle of Fedora is a 
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metal building with a crystal globe in every room, each containing a model of the city 
as it might have been in a possible future, constructed at a different stage of its history. 
At every solstice and equinox, around the fires of the marketplace of Euphemia, there 
is trade not in goods but in memories. In Ersilia, the inhabitants stretch strings 
between all the houses — marking relationships of blood, of trade, authority, agency 
— until one can no longer pass, all but the strings are taken down, and Ersilia is built 
again elsewhere. Thekla is continually under construction, following the blueprint 
of the stars, while Andria already reflects the heavens precisely — in every street, 
building, job, and ceremony — but those who live there must carefully weigh each 
change to the city, given the changes it will produce in the heavens. Polo and the 
Khan each propose a model city, from which all others can be deduced. They look 
through atlas pages that contain not only all the cities of the Khan's empire, but all 
those that will one day come to exist (Paris, Mexico City), and all imaginary lands 
(Utopia, New Atlantis). 

It is not hard to picture Tale-Spin as an addition to this list of imaginary lands. 
It is the place made up of nothing but plans within plans within plans. The people 
have no emotions, except those that help select between possible plans. They have 
no memories, except of plans underway, plans that have failed, and the locations of 
things they may use in plans. And these locations — the very geographies of this 
imaginary place — come to exist only as needed by their plans. 

Like one of Calvino's cities, Tale-Spin is an alien place. And yet, each is alien 
because some element that we recognize of our own lives becomes the defining element, 


practically the only element, of the people and landscape. On some level we do trade 
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in memories like the inhabitants of Euphemia, clot free passage with networks of 
connection like the inhabitants of Ersilia, and, like the inhabitants of Tale-Spin, 
make Chinese boxes of plans within plans that at times obsess us so that nothing else 
seems to exist. 

But what is perhaps most strange to us about Tale-Spin now — three decades after 
its creation, and outside the context of Al research — is that it was not intended 
to seem strange in these ways. It was meant as an allegory, as a simulation, as a 
demonstration that real people act in a particular way: according to Schank and 
Abelson's theories. And as an operational allegory, rather than a static story, part 
of its goal was to produce stories of the sort we recognize, stories of the sort we tell 
ourselves. 

As part of this, Meehan experimented with some stories of human characters, 
rather than animals. In an appendix, Meehan provides the story of a person whose 
problem is SIGMA-SEX. Here the strangeness becomes so apparent that even Meehan 


cannot gloss over it, but his diagnosis is odd. Here are excerpts from the story: 


Once upon a time Joe Newton was in a chair. Maggie Smith was in a 
chair. Maggie knew that Joe was in the chair. One day Maggie was 
horny. Maggie loved Joe. Maggie wanted Joe to fool around with Maggie. 
Maggie was honest with Joe. Maggie wasn't competitive with Joe. Mag- 
gie thought that Joe loved her. Maggie thought that Joe was honest with 
her. Maggie wanted to ask Joe whether Joe would fool around with Mag- 
gie.... [She travels to where he is and asks him. His relationship with her 
(competition, honesty, etc.) is defined, he makes the necessary inferences, 
and agrees. They each travel to Joe's bed. Then...] Joe fooled around 
with Maggie. Joe became happier. Maggie became happier. Joe was not 
horny. Joe thought that Maggie was not horny. Joe was wiped out. Joe 
thought that Maggie was wiped out.... [Maggie makes all the same infer- 


257 


ences, and then, because she's wiped out, this results in SIGMA-REST, 
and... Maggie wanted to get near her bed. Maggie walked from Joe's 
bed across the bedroom down the hall via the stairs down the hall across 
the living room down the hall via the stairs down the hall down the hall 
through the valley down the hall across a bedroom to her bed. Maggie 
went to sleep. Joe went to sleep. The end. (p. 165-166, original in all 


caps) 

Meehan does not comment on the elaborate traveling included in the story — this 
is the sort of thing that could be fixed at the language generation end, with a more 
complicated Mumble. The specifics of what the characters decide to do are much 
closer to the heart of Tale-Spin than the level of detail at which they are reported. 
Given this, Meehan comments that “The least Joe could have done would be to let 
poor Maggie sleep in his bed” (p. 166) — as though the problem with the story lies 
in the design of SIGMA-SEX. 

We might like an instance of SIGMA-REST that results from a successful SIGMA- 
SEX to understand its context and prefer for the characters to sleep in the same place. 
But Tale-Spin does not work that way. This is similar to George Bird deciding to 
answer Arthur Bear out of kindness, and then the loss of that context resulting in his 
answer being an unkind lie. There aren't enough connections between the elements of 
Tale-Spin's planning procedures for this sort of problem to be resolved with a simple 
tweak to one element, like SIGMA-SEX. 

Further, the problems with Tale-Spin telling stories of love run much deeper than 
SIGMA-SEX. Consider the following statement from Meehan's dissertation in terms 


of our culture's stories of love — for example, any Hepburn and Tracy movie: 


“John loves Mary” is actually shorthand for “John believes that he loves 
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Mary” ... Pm not sure it means anything — in the technical sense — to 
say that John loves Mary but he doesn't believe that he does. If it does, 
it's very subtle. (p. 47) 


In fact, it is not subtle at all. It is a significant plot element of the majority of 
romantic novels, television shows, and movies produced each year. But from within 
Meehan's context his conclusion is perfectly rational. If John doesn't know that he 
loves Mary, then he cannot use that knowledge in formulating any conscious plans 
— and in Tale-Spin anything that isn't part of conscious planning might as well not 
exist. 

This blindness to all but planning — this assumption that planning is at the 
center of life — was far from unique to Meehan. It might be argued that Schank's 
lab was remarkable in its assumption that we not only make plans, but also follow 
scripts: that there are two things that made up intelligent behavior. But both plans 
and scripts assume a fully specified account of intelligent action that exists before the 
action takes place — which, in the broader cognitive science community, was basically 
the definition of a plan. And within this wider Al and cognitive science community, at 
the time of Meehan's work, the understanding and generation of plans was essentially 
the sole focus of work on intelligent action. Debate, as between “neat” and “scrufty” 
researchers, centered on what kind of planning to pursue, how to organize it, and 
so on — not on whether planning deserved its central place as a topic for attention. 
This was in part due to the field's technical commitments, and in part the legacy of 
a long tradition in the human sciences. Lucy Suchman, writing a decade later in her 


book Plans and Situated Actions (1987), put it this way: 
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The view, that purposeful action is determined by plans, is deeply rooted 
in the Western human sciences as the correct model of the rational actor. 
The logical form of plans makes them attractive for the purpose of con- 
structing a computational model of action, to the extent that for those 
fields devoted to what is now called cognitive science, the analysis and 
synthesis of plans effectively constitute the study of action. (p. ix-x) 

This view has, over the last few decades, come under widespread attack from 
both outside and within Al. As Suchman puts it, “Just as it would seem absurd to 
claim that a map in some strong sense controlled the traveler?s movements through 
the world, it is wrong to imagine plans as controlling action” (p. 189). As this has 
happened — and particularly as the mid-1970s theories of Schank, Abelson, and 
Meehan have moved into Al's disciplinary history — Tale-Spin has in some sense 
lost its status as a simulation. There's no one left who believes that it represents a 
simulation of how actual people behave the in the world. 

As this has taken place, Tale-Spin has become, 1 would argue, more interesting as 
a fiction. It can no longer be regarded as an accurate simulation of human planning 
behavior, with a layer of semi-successful storytelling on top of it. Rather, its entire 
set of operations is now revealed as an authored artifact — as an expression, through 
process and data, of the particular and idiosyncratic view of humanity that its author 
and a group of his compatriots once held. Once we see it this way, it becomes a new 
kind of fiction, particularly appreciable in two ways. First, it provides us a two-sided 
pleasure that we might name “alterity in the exaggerated familiar” — one that recalls 
the fictions of Calvino's Invisible Cities. At the same time, it also provides an insight, 
and cautionary tale, that helps us see the very act of simulation-building in a new 


light. A simulation of human behavior is always an encoding of the beliefs and biases 


260 


of its authors — it is never objective, it is always a fiction. 


Mumble and Natural Language Generation 


The problem of creating a readable story after a run of Tale-Spin is that of turning 
a set of conceptual dependency (CD) expressions — semantic representations — into 
sentences. Given Schank's vision of CD expressions as an interlingua, much work had 
already been done in his lab on converting them into readable text. At first Meehan 


tried using the program then in active use at Schank's lab: 


That program, called Babel, is currently three-quarters the size of Tale- 
Spin, and the two programs are too large to be run together on the system 
at Yale. Even running them separately and communicating via disk files 
proved too cumbersome. But more important, Tale-Spin and Babel were 
not terribly well suited to each other: Babel has many features, like au- 
tomatic paraphrasing, that Tale-Spin doesn't use, and Tale-Spin requires 
some constructions, like certain modals, that are not implemented in Ba- 
bel. So 1 abandoned the idea of using Babel and, instead, wrote Mumble. 
It's a quick 'n” dirty program, written in a day, and it [sic] many of its 
parts do not correspond to the way humans speak, but it produces ade- 
quate, if somewhat verbose, sentences. Best of all, it's one tenth the size 
of Babel. (p. 144) 


Here we see, again, the strong priority of story over discourse in Tale-Spin. Almost 
all of the project's resources — computationally, and in terms of Meehan's time — 
were dedicated to work on the production of stories. Meehan's preferred solution 
was to make the creation of surface text someone else's problem, by using a program 
written and kept up by others in his lab. When this proved impossible, he gave a day 


to a minimal solution. 
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Mumble is now long forgotten by most, including most who write about Tale-Spin. 
And CD expressions have fallen on hard times intellectually, as eventually happens 
with all dreams of a computable universal language.*% And yet it is interesting to 
note that the basic assumption of Mumble, Babel, and CD expressions remains the 
dominant one in computer science work on text generation: that the starting point 
should be abstract meaning representations, which are progressively refined (following 
rules of syntax) into grammatically correct sentences. 

Following this path requires somehow getting knowledge about language into the 
computer program. And the more we do this, the more what the program works with 
is removed from concrete instances of language use, and the more our work with the 
program departs from the work of writers. It is a pursuit that — from the point of 
view of fiction — is both exciting and somehow unsettling. And it represents one 
end of a continuum that also includes examples familiar from earlier chapters, such 


as Christopher Strachey's love letter generator. 


Structure-oriented NLG 


The goal of Natural Language Processing (NLP) is for computer programs to work 
with human utterances in natural languages, such as English. The subfield of Natural 
Language Generation (NLG) is focused on the production of linguistic communica- 


tions, rather than their interpretation or understanding. As mentioned earlier, if we 





Though CD expressions have lost their currency, in some sense the notion of a small set of 
primitive ACTSs is alive and well in the sets of “verbs” made available to the audience of many 
interactive systems. And, given that the dreams of a computable universal language stretch at least 
as far back as Leibniz, they seem likely to be with us for some time, in one form or another. 
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plan to produce these communications based on what we know about the structure of 
language, we need to somehow make that linguistic knowledge part of the computer 
program. While there are many possible approaches to this, we can roughly identify 
them as falling along a continuum between two extremes. 

One extreme is to simply write the chunks of text ourselves, so that these chunks 
embody our linguistic competence. At this extreme, the computer need not know 
anything about the content of what we have written, because the computer will never 
need to do anything but display one of these chunks at the appropriate moment. 
Many computer games function this way, with human-written chunks of text read 
aloud by voice actors and the resulting sound files triggered by the system when the 
appropriate moment is determined by non-linguistic processes within the game. 

The opposite extreme is to attempt to put all the linguistic competence into the 
program. Using artificial intelligence techniques, the software would determine (at 
the level of meaning) what messages need to be conveyed to the audience. Then, 
using general knowledge about human language, a body of knowledge about the 
specific language in which the messages are to be conveyed, topic-specific knowledge 
of how ideas in this domain are typically expressed, and some sort of mechanism for 
defining and choosing between the different options deducible from this knowledge, 
the software would produce a chunk of text fully customized for that situation. No 
trace of a message, except as abstract knowledge, would exist before it was assembled. 
No operational system such as this exists, because many non-trivial research questions 
would need to be answered before one could be constructed. At the time of Tale- 


Spin, those in Schank's lab saw their work in the context of this ambition — as 
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is apparent in Neil Goldman's contribution to Conceptual Information Processing 


(Goldman, 1975) which outlines the design of Babel. 


NLG templates 


Structure-oriented NLG systems fall between the extremes outlined above. The sim- 
plest systems, perhaps too simple to be considered true NLG, are template-driven 
systems. These systems have a structure, a template, in which certain pieces of con- 
tent are left to be determined (they are “open slots” in the template). Also, aspects 
of the template may vary in simple ways. 

The best-known template systems, in everyday life, are letter-generating systems. 
These are used for everything from broad-based political fundraising to very specif- 
ically individual (and yet consistently structured) professional communications of 
doctors and lawyers. These systems may simply fill in the name of the recipient, and 
send otherwise-identical letters to every address receiving a certain type of letter, or 
they may insert or omit a wide variety of paragraphs, sentences, and even words to 
match the data the system knows about a recipient. 

We can see from this short overview of template-driven systems that Strachey?”s 
love letter generator is a somewhat complicated example of this type. Its overall 
structure is of an opening slot, five sentence slots, and a closing slot. Each sentence 
slot is, in turn, occupied by one of two sentence templates, and the open slots of 
these are filled by words. The template structure can elide the boundary between 
two sentences if they are both of the second (“You are ...”) type. 


Queneau's Poems is of an even simpler template design. It has fourteen slots, 
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each of a different type. Then there are ten data items (lines) of each type. And 
that's all. This bears out our conclusion that Queneau's project achieves most of its 
structure through the careful construction of its data.?! 

As with the “chunks of text” approach, most of the linguistic structure in a 
template system comes from human authoring that is expressed as completed text, 
rather than as structures expressed in computable form. This makes template-driven 
systems easier to construct than more complicated NLG systems, but it also provides 
less flexibility. Changing a text's tense, for example, would probably be accomplished 
through providing an alternate version of the template. Many NLG systems of the 
more complex varieties, on the other hand, would have little trouble generating past 
or present tense messages based on encoded knowledge of how tense functions in the 


language in question. 


Departing from writing 


Moving away from template-driven approaches, into the area of “true” structure- 
oriented NLG, we are also moving further from writing. This is true in two senses. 
The information we supply to the system is further from writing. Also, given the 
difference in the information we supply, it becomes harder to use the techniques 
traditionally used by authors to shape the surface output of the system — revision 
becomes something quite different from traditional textual editing. These facts are 


likely part of the reason that NLG techniques more complicated than templates have 





lMad Libs are the simplest form of template, operating with even fewer constraints than simple 
letter generating (“mail merge”) systems. 
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rarely been used by writers. 

In their article “Building Applied Natural Language Generation Systems” (1997) 
Ehud Reiter and Robert Dale outline six basic kinds of activity for NLG systems, 
using the example of a system that answers questions about rail travel. Content 
determination is the process of getting the semantic input that the NLG system will 
turn into text, and creating a set of messages that will be used by the further steps (in 
the example: the next train on the route, when it leaves, and how many trains a day 
travel that route). Discourse planning structures the messages, usually into one of 
computer science's ubiquitous trees (in the example, the identity of the next train and 
its time of departure become the leaves of a “next train information” node — with 
an “elaboration” relation between them — which is linked to the “number of trains” 
information by a “sequence” relationship expressed at the root of the tree). Sentence 
aggregation, in turn, is the process of deciding which messages should be grouped 
into sentences, often leaning on the tree data (in the example, it is pointed out that 
the two leaves might be combined so that the next train's name and departure time 
would be in one sentence). Lericalization is the activity that determines the words and 
phrases that will be used to express particular concepts and relations. Lexicalization is 
particularly important for systems that output in multiple languages, but can also be 
a good place to explicitly provide variation (to prevent monotony in output) or make 
choices about word usage (in the example, it is suggested that depart is perhaps more 
formal than leave). Referring expression generation is in some ways closely related 
to lexicalization, in that it is the selection of words or phrases to refer to entities in 


the messages. However, it is more focused on context, particularly the context of text 
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generated thus far (in the example, this is the question of when expressions like “it,” 
“this train,” and “the Glasgow train” are appropriate for referring to a previously- 
mentioned train). Linguistic realization is the application of rules of grammar to 
form the output from the previous processes into text that is correct syntactically, 
morphologically, and orthographically. (In the example, the system produces the 


» 


sentence “There are 20 trains each day from Aberdeen to Glasgow.” The syntactic 
component of the realizer added “from” and “to” in order to mark the train's source 
and destination, the morphological component produced the plural “trains” from the 


root “train,” 


and the orthographic component capitalized the initial word and added 
a period at the end.) 

As a practical matter, NLG systems do not generally have six different compo- 
nents for these six different activities. Reiter and Dale suggest that the most com- 
mon architecture actually consists of a three stage pipeline: text planning (content 
determination and discourse planning), sentence planning (sentence aggregation, lex- 
icalization, and referring expression generation), and linguistic realization (syntactic, 
morphological, and orthographic processing). Reiter and Dale's account here is in 
agreement with that of other NLG authorities, such as Eduard Hovy, whose sum- 
mary in the The MIT Encyclopedia of the Cognitive Sciences names these stages 


”» 


using the almost-identical terms “text planning,” “sentence planning,” and “sentence 
realization” (Wilson and Keil, 1999). 
This overview underscores another likely reason that few writers — and, in fact, 


few digital literature practitioners from any background — have made use of tradi- 


tional NLG techniques. As in Tale-Spin's design, these systems are generally con- 
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structed from the assumption that messages precede language. Their architectures 
are developed so that well-defined messages can be “realized” as texts. And yet cre- 
ative writing, for many writers, is the practice of what could not be said any other 
way. Its messages do not precede its text, but emerge through the specifics of its 
language. 

Yet, even if CD expressions are no longer in vogue, itis hard to imagine a story gen- 
eration system such as Tale-Spin creating text by any other logic than this. Doesn't 
this work have to proceed from semantic representation to linguistic realization? 
And doesn't this movement have to take place through generically-defined processes 
and knowledge, rather than from any specific text written by humans? If not, how 
would the vast number of possibilities that arise in a simulated world, which are only 
available to the system on a symbolic/semantic level, be expressed in language? At- 
tempting it with a simple template system could result in an explosion of work for 
the template author from even a few additions or changes to what is to be expressed 
about the world simulation. 

Of course, in the previous chapter, we saw a different approach to text generation: 
that of n-grams. This produced text that was strongly based on concrete instances of 
human-written language, rather than abstract rules. And, in fact, n-grams are one of 
the most important tools used in another set of approaches to NLP, those known as 
“statistical” techniques. These, in turn, are connected to a wider set of innovations 


in statistically-based Al, and will be discussed further in the next chapter. 
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Mumbled stories 


Given this overview, we see that the approach used for Mumble remains, to this 
day, near the mainstream of work in NLG. More work on Mumble could no doubt 
have produced smoother language for Tale-Spin stories, and perhaps eliminated some 
needless detail. But we can also see from this overview that the big problem with 
Mumble”s stories cannot be laid at its doorstep. 

The big problem with the output from Mumble is that it doesn't include the most 
interesting structures developed in Tale-Spin's “internal” fictions. Take, for example, 
George Bird trying to decide how to answer Arthur Bear's request to tell him where 
to find honey (page 251). Meehan doesn't provide us with the Mumble output from 
chapter 11's story, but a story with a similar situation in it is reprinted in an appendix 
to Meehan's dissertation. Here is the Mumble output of a series of considerations and 
speculations that, given Tale-Spin's structures, is probably much like those in Arthur 


and George's story: 


Tom asked Wilma whether Wilma would tell Tom where there were some 
berries if Tom gave Wilma a worm. Wilma was inclined to lie to Tom. 
(p. 168) 


The empty space between those two sentences is undoubtedly one of the most 
interesting parts of this story, if only we could see it from the “interior” view of 
Tale-Spin's operations. But Mumble stories never contain accounts of characters” 
multi-level speculations, or elaborate considerations of potential plans. And, given 
the standard NLG architecture which turns semantic messages into sentences, we 


must assume that the reason for this lies with Tale-Spin. It appears that Mumble 
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never receives the CD expressions for these parts of the Tale-Spin story. 

Of course, it may be that Tale-Spin never sends these to Mumble because it 
no longer has them. It may be that Tale-Spin only keeps the CD expressions that 
describe its textual actual world (TAW) and one level of remove: what characters, 
what objects, what goals, what plans. It keeps a history of those that were previously 
true, and Mumble gets access to these (so that it can say, for example, “go back to 
sleep” rather than “go to sleep” if a CD expression in memory says the character 
has been asleep recently, p. 146). But Tale-Spin may not keep track of the various 
possible worlds that characters explored, through the inference mechanisms, in the 
process of deciding how to act. And, whether it does or not, the fact remains that 
Mumble never gets sent the CD expressions for where the real action is with Tale-Spin 


characters: speculating, concocting lies, imagining possible worlds. 


Re-Reading Tale-Spin 


While Tale-Spin is the most widely discussed story generation system, much previous 
scholarly work has not examined its specifics in any depth. Because this work has 
not been anchored by an engagement with the operations of Tale-Spin, the system 
has been seen in radically divergent terms — and used as an example in arguments 
that are, at times, simply contradictory. Let us revisit and evaluate a few of these 


arguments, with a more full understanding of Tale-Spin at our disposal. 
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Al as writing 


Jay David Bolter's Writing Space (1991) discusses Tale-Spin in the context of an 
argument that “artificial intelligence is simply another way to write with the com- 
puter” (p. 171). There are two ways this could be meant. On a metaphorical level, it 
could mean that the encoding of symbolic structures, such as those that drive Tale- 
Spin's simulation, should be viewed as a kind of authoring. Our analysis of Tale-Spin 
certainly supports this. Whether consciously or unconsciously, any simulation of hu- 
man behavior is the result of authorship, rather than an ideology-free encoding of 
objectively-determined rules. And something like this interpretation of artificial in- 
telligence as writing with the computer is certainly present in Bolter's argument. He 


writes: 


Artificial intelligence programmers, in particular, play God, in the sense 
that they build programs that are meant to act in a human fashion. Artifi- 
cial intelligence specialists are sometimes quite open about such a parallel. 
They see themselves as demigods as they set about to create intelligent 
and therefore animate machines. What they fail to realize is that they are 
demigods in exactly the same sense as authors of narrative fiction have 
always been demigods of the stories they write. (p. 191) 


However, there is also another potential meaning for Bolter's assertion that arti- 
ficial intelligence is a way to write with the computer: he could mean that Al should 
be understood as a field dedicated to producing text. This would be quite incorrect, 
as we have seen. It is not only incorrect for Tale-Spin in particular (which produced 
no text, depending on Mumble for this purpose) but also for Al as a whole — which 


overlaps, but is not subsumed by, the field of natural language generation. Unfortu- 
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nately, this incorrect meaning is also present in Bolter's argument. For example, he 
writes of Tale-Spin: “It is easy to see in the stories of Joe Bear that artificial intelli- 
gence is the art of making texts” (p. 180). The slippage between these two meanings 
of “artificial intelligence as writing” is continual throughout the argument, leading it 
astray at a number of points. While Bolter may in part be excused for this error by 
the almost overwhelming “linguistic turn” in academic thinking that still influenced 
much writing at the time of his book, the fact remains that he might easily have 
avoided the fallacious part of his argument (and strengthened the accurate part) by 


simply paying more attention to the actual operations of the Tale-Spin system. 


Missing characters 


Janet Murray discusses Tale-Spin in rather different terms in her book Hamlet on the 
Holodeck (1997). Tale-Spin is discussed in the context of Murray's argument that “for 
authors to create rich and satisfying stories that exploit the characteristic properties 
of digital environments ... [writers would need a concrete way to structure a coherent 
story not as a single sequence of events but as a multiform plot” (p. 185). We might 
expect, given this, for Murray to criticize Tale-Spin for organizing its operations at 
the level of character, rather than at the level of plot. Instead, however, Murray 
seems to assume that Tale-Spin does operate at the level of plot, and simply does so 
defectively. 

Murray reprints the famous mis-spun tale of Joe Bear forming the failed goal, over 
and over, of bringing Irving Bird a worm so that Irving will tell him where a worm 


is (see page 222). She precedes the reprinting by saying that “stories told from an 
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abstract representation of narrative patterns but without a writer's relish for specific 


material can be incoherent” (p. 200). After the story Murray writes: 


The program goes into a loop because it does not know enough about the 
world to give Joe Bear any better alternatives. The plot structure is too 
abstract to limit Joe Bear's actions to sequences that make sense. (p. 200) 


Actually, as discussed earlier, Tale-Spin looped because — at the partially-completed 
state it was in at the time this mis-spun tale was generated — its characters could 
reassert a goal that had already failed (see page 248). In fact, Joe Bear's problem had 
to happen at the character level — it could not happen at the level of “plot structure” 
— because Tale-Spin has no “abstract representation of narrative patterns” at all. 

This problem does not cause Murray?s argument to derail, by any means. Hers is 
mainly a speculative argument, about the sorts of experiences that might eventually 
be possible with interactive story systems. In fact, Murray spends many more pages 
on her imagined system for interactive stories set in the Casablanca world than she 
does on Tale-Spin or other actually implemented examples. 

No, rather than a derailing, Murray?s error simply leads to a missed opportunity. 
As the next chapter of her book demonstrates, she is very interested in systems 
that model the interior operations of fictional characters. And characters like Joe 
Bear and George Bird have quite complex interior operations, if one looks beyond 
the anemic events output by Mumble. For example, as just discussed, when Joe 
asked George about the location of some honey, George first “speculated” as to the 
results of a number of possible actions, looking for one that would bring him some 


advantage. Then, finding no advantage along any foreseeable path, George decided 
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what to do based on his personality, with the result being a relatively complex lie 
(which, among other things, resulted in the creation of whole new geographies in the 
simulated world). 

The Tale-Spin model is, of course, problematic — and the cognitive science ideas 
that inspired it are now abandoned. But, nevertheless, its operations provide much 
that deserves the attention of writers such as Murray. The complex character behavior 
Tale-Spin produced in the 1970s is much more likely than an imaginary Casablanca 


system to make convincing fodder for an argument such as Hamlet on the Holodeck's. 


Imagined narrators 


Espen Aarseth, in his book Cybertext (1997), calls Tale-Spin “a cybernetic fiction 
device that does not work” (p. 131). He concludes this based on a selection of its 
mis-spun tales — an approach that we saw, at the beginning of this chapter, is quite 
problematic. Aarseth does see fit to qualify his statement with the phrase “at least 
in the examples given here,” but the rhetoric of failure is important for the point he 
seeks to make. Tale-Spin is one of Aarseth's three primary examples for the argument 
that machine narrators should not be “forced to simulate” human narrators (p. 129). 
Tale-Spin is presented as a failed example of such simulation, with its mis-spun tales 
its only claim to interest. 

From the viewpoint of Al, Aarseth's is an exceedingly strange argument. As we 
will discuss in the next chapter, the primary critique of Tale-Spin in Al circles is 
precisely that it does not attempt to simulate a human narrator. Tale-Spin simu- 


lates characters — not narrators, not authors. This has been seen as a fundamental 
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mistake, and was outlined by writers such as Natalie Dehn (in widely-cited papers) 
more than 15 years before the publication of Aarseth's book (Dehn, 1981b,a). 
However, as with Murray, we can still follow Aarseth's argument even while ac- 
knowledging his mistake. When we see phrases such as “trying to create a surrogate 
author” we can substitute something like “ideological attachment to narrative ideals” 
(p. 141). This is because Aarseth is arguing against simulating human narrators only 
as a proxy. He's really arguing against focusing attention on attempts to use the 
computer to extend the pleasures of traditional fiction and drama. Instead, Aarseth 
seeks to turn our attention to literature based on such features as combinatorics, 
interaction, and play — on the new literary possibilities opened by the specifics of 


the networked computer. As Aarseth writes: 


To achieve interesting and worthwhile computer-generated literature, it is 
necessary to dispose of the poetics of narrative literature and to use the 
computer's potential for combination and world simulation in order to de- 
velop new genres that can be valued and used on their own terms. Instead 
of trying to create a surrogate author, efforts in computer-generated lit- 
erature should focus on the computer as a literary instrument: a machine 
for cybertext and ergodic literature.... [T]he computer as literary agent 
ultimately points beyond narrative and toward ergodic modes — dialogic 
forms of improvisation and free play... (p. 141) 


It is the puzzle of Tale-Spin that we can diagnose, here, a problem very much 
like Murray's. In Tale-Spin Aarseth has a great missed opportunity. However much 
Aarseth asserts the contrary, Tale-Spin is not trying to simulate a human narrator. 
In fact, the story structures it produces are almost never like those that a human sto- 
ryteller would produce. Instead, it produces strange structures of plans within plans 


within plans. It produces what we might call, from the possible worlds perspective, 
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“minimalist fictions” — made up almost entirely of possible worlds of planning, spec- 
ulation, lies, and so on (without redundant emotions, movements, even geographical 
locations). Tt is combinatory engine for spinning off possible worlds embodying an 
alien vision of humanity, driven by the temporary worldview of a research lab. In 
other words, Tale-Spin can be seen as an example of one of the types of literature for 
which Aarseth is calling. 

Aarseth's missed opportunity, combined with Murray's missed opportunity, helps 
us see something interesting. Tale-Spin, early as it was, stands at an important 
crossroads. If we choose to emphasize its continuities with traditional fiction and 
drama, via its characters, then it becomes a useful touchstone for views such as 
Murray's. If we choose to emphasize its complicated strangeness, its computational 
specificity, then it becomes an important early example for views such as Aarseth's. 
In either case, a close examination of the system's operations reveals something much 


more intriguing than either author assumed. 


Learning from Tale-Spin 


Having given our close attention to Tale-Spin, what lessons can we come away with? 

For one thing, we have seen how a system intended to simulate human behavior is 
inevitably an authored system. It cannot escape being ideological, because it cannot 
escape encoding a set of beliefs about human behavior. We can bring this example 
forward, to current simulations of human behavior — including computer games. For 


example, we can read the encoding of consumer culture in games like The Sims and 
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the encoded military behaviors of squad-based combat games. The knowledge gained 
from our examination of Tale-Spin will not even be dated, despite the fact that it is 
a project from three decades ago. In fact, a number of current games use techniques 
from the era of Tale-Spin. The game that won GameSpot's 2005 award for best Al 
was F.E.A.R. — a game featuring Al explicitly based on a model even earlier than 
Tale-Spin. Specifically, F.E.A.R. employs a model of planning based on the STRIPS 
(STanford Research Institute Problem Solver) system from 1970 (Orkin, 2006). 

At the same time, we can also turn our attention to simulations that make truth 
claims. The history of these, as it turns out, also stretches back to before Meehan's 
work. For example, the “Urban Dynamics” simulations of Jay Forrester, in the 1960s, 
have been presented as evidence that building low income housing helps to stagnate 
a City, while tearing down low income housing creates jobs and a rising standard 
of living for all of the city?s inhabitants. Some policy makers have been taken in 
by “evidence” such as that produced by Forrester?s simulations, despite the fact 
that the encoded processes determining system behavior have no greater scientific 
basis than those underlying Tale-Spin. By being a fiction, Tale-Spin helps make the 
inappropriateness of this kind of thinking more legible than it is in a scientific context. 

As a fiction, Tale-Spin also helps demonstrate that non-linguistic logics — such as 
planning — can be employed toward literary ends. The next two chapters discusses 
a number of story generation systems that take this further. These systems operate 
on complex arrangements of symbols that represent the structures of literary themes, 
encode methods of story transformation, and even explicitly (rather than implicitly) 


represent ideology. 
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Finally, Tale-Spin, as a fiction, also provides a lesson — or, perhaps, a warning 
— for those interested in creating process-intensive fictions. Meehan's work has been 
consistently underestimated, in literary circles, for three decades. This was not, 
primarily, because it wasn't an interesting contribution. Rather, it was because the 
most interesting character behaviors it produced were not visible on its surfaces. 
Even dedicated humanities scholars were repelled by the work's uninteresting surfaces, 
failing to unearth evidence of its more interesting operations — something that would 
have been impossible if, like Queneau's One Hundred Thousand Billion Poems, it 
had depended on a much simpler set of operations that could be carried out by the 
audience. Given the experience of these scholars, we can only assume that audience 
members less accustomed to research had a similar experience. In other words, Tale- 
Spin's fascinating operations were understood by very few beyond a small circle of AI 
researchers. Anyone interested in producing literary work with digital computation 


should consider this lesson deeply. 


Authorial and Statistical Intelligence 


Author Functions 


What is an author? Within computer science this question has been pursued in a 
manner somewhat different from that found in literary criticism. While literary critics 
may talk of an “author function” they do not literally mean a mathematical function 
that carries out the work we identify with authors. Al researchers, on the other hand, 
have been in search of precisely such a computable function. 

This chapter considers two rather different approaches to authoring. One is a 
continuation of the symbolic, scrufty Al approach that underlies Tale-Spin. It builds 
on a combination of high-level ideas from the cognitive sciences and implementation- 
focused techniques from computer science. Like Tale-Spin, the two systems consid- 
ered here (Minstrel and Universe) seem to us, now, quite human-authored artifacts. 
The systems?” creators, however, sought to focus attention on a different author: one 
represented within the systems themselves. 

The other approach to authoring considered in this chapter is a significantly dif- 


ferent one. Rather than building the sort of symbolic structures found in Tale-Spin, 
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this kind of AI — statistical Al — is created using structures of the sort we saw in 
Travesty and other n-gram experiments. While statistical AI has become a nearly 
unqualified success in applications such as web search engines, other areas, such as 
natural language generation, are still strugeling with the potential of these techniques. 
Statistical Al is less legible in its operations, less predictable in its mappings between 
process and surface, and in other ways significantly less authorable than symbolic Al. 
At the same time, even though they are based on mathematics, statistical techniques 
should not be misunderstood as objective, or considered inscrutable from the perspec- 
tive of the arts, humanities, or social sciences. Every implementation of a statistical 
technique must select the features that will be employed in its model; the elements of 
models are weighted and smoothed based on authorial assumptions; statistical tech- 
niques are often combined with more structure-oriented techniques — and, in short, 
the hand of the author still makes itself felt in many ways. 

At this moment, statistical Al is ascendent, and other alternative Al formula- 
tions are flourishing, while symbolic Al is struggling. Despite this, certain beliefs 
and practices in the field that we identify with symbolic Al are not easily uprooted 
— continuing even in the critique-motivated work of more recent Al practitioners. 
In particular, humans as authors of Al systems continue to build their systems in 
humanity's image. The chapter closes with a short consideration of Al's persistent 
need to anthropomorphize its processes and asks whether, when our goals are literary, 


there is any alternative to this approach. 
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Minstrel and Universe 


While the previous chapter's treatment of Tale-Spin positioned it as strange in its 
blindness to all but planning, for understandable reasons the artificial intelligence 
community did not respond this way to Tale-Spin's limitations. Rather, the common 
diagnosis was that Tale-Spin didn't make enough plans. Specifically, Tale-Spin was 
critiqued for only accounting for the plans of characters, and not the plans of authors. 
The Minstrel and Universe systems were two that sought to address this perceived 
lack in different ways. 

Early publications about the Minstrel and Universe systems appeared in the mid- 
1980s, and bore a number of similarities. Not only did both emphasize the importance 
of authorial plans and goals (and cite Natalie Dehn's 1981 publications about the 
importance of such goals), they also focused on the importance of then-recent models 
of how people understand stories. In particular, they focused on Schank's “Memory 
Organization Packages” (also called “Memory Organization Points”) and Michael 
Dyer's “Thematic Abstraction Units” (also called “Thematic Affect Units”). No 
doubt this was in part due to the histories of the two systems” architects. The 
primary designer of Minstrel, Scott Turner, was at that time a student of Dyer's at 
UCLA, and Dyer had recently completed a dissertation at Yale influenced by Schank 
and Abelson's ideas.* The primary designer of Universe, Michael Lebowitz, had also 


recently written his dissertation at Yale under Schank's supervision — contributing 





¡Dyer's dissertation became the book In-Depth Understanding: A Computer Model of Integrated 
Processing for Narrative Comprehension (MIT Press, 1983). 
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to Schank's model of dynamic memory, especially in relation to story understanding? 

Working with this shared background, it is not surprising that, on one level, the 
Minstrel and Universe models of authorial activity are quite similar. Both see the 
system as seeking to provide for the audience those elements identified as necessary 
for story understanding. Of especial importance, for both Minstrel and Universe, 
is the idea of the “point” or “theme” that the system is attempting to put forth 
(Lebowitz, 1984, 175) (Turner and Dyer, 1985, 373) — something notably absent in 
most of Tale-Spin's stories about characters trying to satisfy simple needs. 

At the same time, the Minstrel and Universe systems are underpinned by rather 
different beliefs about how their processes relate to the behaviors of human authors. 
An examination of these differing beliefs is helpful in explaining the differing results 


achieved by the two efforts. 


Minstrel 


The Minstrel system was brought to completion over the course of a decade, resulting 
in Turner's 1994 publication of The Creative Process: A Computer Model of Story- 
telling and Creativity. Over the first few years of Minstrel”s development, some of the 
ideas at its foundation continued to evolve. Particularly, in Schank's lab the model 
of dynamic memory and its adaptations was extended into the idea of “Case-Based 
Reasoning” (CBR). The basic idea of CBR is in some ways quite close to that of 


scripts: in the main people do not decide what to do in each situation by reasoning 





2Lebowitz's dissertation was “Generalization and Memory in an Integrated Understanding Sys- 
tem,” filed December 1980. 
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from first principles, but rather by drawing on previous knowledge. However, rather 
than suggesting that each of us has a “restaurant script” and a “sports event script” 
and so on, case-based reasoning assumes that we remember many cases, and rea- 
son from them — much as the learning of previous cases is formalized in legal and 
business education. 

According to CBR theory, humans have three major types of cases we work with. 
There are “ossified cases” that have been abstracted to the point where they are 
essentially rules, such as proverbs. There are “paradigmatic cases,” each of which 
turns out to be the only experience we have that is relevant to a particular current 
situation, and which we adapt in order to understand the new situation. Finally, 
the most complex structures are “stories,” which Schank and Riesbeck characterize 
as “unique and full of detail, like paradigmatic cases, but with points, like proverbs” 
(1989, 13). The continuing reinterpretation of stories is described as the “basis of 
creativity in a cognitive system” (p. 14). 

Over the decade of work on Minstrel Turner also further developed his position 
on how story generation systems should be designed — and, in particular, what 
relationship such designs should have to the way that human authors go about their 
work. In Turner's conception, the relationship should be one of simulation. As he 


writes in The Creative Process: 


Authors craft stories to achieve a wide variety of complex and often com- 
peting goals. To understand why storytelling is so difficult, we must 
understand what an author is trying to achieve. To build a computer 
program to tell stories, we must understand and model the processes an 
author uses to achieve his goals. (p. 3) 
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In other words, much as Meehan sees the key to story construction in Tale-Spin as 
the accurate simulation of the goal-directed behavior of people as characters, Turner 
sees the key to story construction in Minstrel as the accurate simulation of the goal- 
directed behavior of people as authors. Given that Turner remained, at the time 
of this work, committed to the view of human behavior articulated by Schank and 
his colleagues, there was only one option for the design of Minstrel. As Schank and 
Riesbeck put it, “Case-based reasoning is the essence of how human reasoning works” 


(p. 7). Therefore, as Turner writes, “Minstrel is a type of case-based reasoner” (p. 11). 


Creating stories from stories 


Minstrel begins storytelling much as some human authors might: with a theme to 
be illustrated. The audience can request a particular theme, or Minstrel can be 
“reminded” of a story with a similar theme. However, Minstrel is not reminded 
by doing a search through the hard drive of the machine where it resides, or by 
monitoring the content of recent email delivered to the user of the machine, or by any 
other happenstance. Instead, Minstrel is reminded by being given a pool of fragments 
structured according to the internal schema representations it uses for characters and 
scenes. Matching fragments against stories in memory will result in one story being 
selected, and then Minstrel will have the goal of telling a story on the same theme. 
Minstrel uses case-based reasoning to meet its goals, including this one. But goals 
also need to be organized. And here we can see the first tension emerging between 
Turner's goals of simulating human behavior and creating a successful system for 


telling stories. Though there's no assertion made that human authors operate in 
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this way, Minstrel's goals are organized as an internal agenda. Planning proceeds 
in cycles, with each cycle attempting to accomplish the goal that currently has the 
highest priority on the agenda. If a goal fails it can be put back on the agenda at a 
lower priority, with the hope that later circumstances will make it possible to achieve. 


Turner describes the agenda this way: 


Minstrel begins storytelling with an initial goal to “tell a story.” This 
goal breaks down into subgoals including selecting a theme, illustrating 
a theme, applying drama goals, checking the story for consistency, and 
presenting the story to the reader. At each cycle, Minstrel selects the 
author-level goal with the highest priority from the goal agenda and passes 
it to the problem solving process. Problem solving finds a plan for that 
goal and executes it. 


Two important actions that Minstrel's plans can take are to create and 
add new scenes to the story, and to create and add new author-level goals 
to the planning agenda. As new scenes are created, they are added to 
the current story. As new goals are created they are added to the goal 
agenda. Storytelling finishes when the goal agenda is empty. (p. 77-78) 
Minstrel's themes are also represented in its schema system. Each theme is actu- 
ally a piece of advice about planning, and represents the kinds of characters, plans, 
and outcomes necessary for illustrating the theme. Though Minstrel is designed to tell 
stories in the domain of King Arthur's knights, its “planning advice themes” (PATs) 
are drawn from Romeo and Juliet, It's a Wonderful Life, and proverbs. For exam- 
ple, one of the PATs drawn from Romeo and Juliet is PAT: Hasty-Impulse-Regretted, 
based on Romeo killing himself upon discovering what he believes is Juliet's lifeless 
body — though, if he had waited a moment longer, she would have awakened from 
her simulated death. Oddly, Turner summarizes his schema representation of this as 


follows: 
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Decision: ¿¿Romeo believes something (4.Belief.1) that causes a goal fail- 
ure for him (£Goal.1). This and his hasty disposition motivate him to do 
something irreversible (*ZAct.1). 


Connection: £Romeo learns something new («%State.4) that supersedes 
the evidence for his earlier belief («2 Belief.1). 


Consequence: ¿Romeo now has a different belief, which motivates him to 
retract his earlier goal ((¿Goal.2) but he cannot, because his earlier action 
(£Act.1) is irreversible. (p. 104) 


Of course, as Turner notes, this is not actually what happens in Shakespeare's play. 
Romeo kills himself, and never knows that Juliet was not actually dead — much less 
regrets his decision. This is perhaps another artifact of the tension between building 
an operational system and simulating human authorship. In Minstrel, character-level 
goals and plans are represented in the schema, and so can be transformed (as outlined 
below). Author-level plans, on the other hand, are each structured, independent 
blocks of LISP code — presumably for reasons of authoring and execution efficiency.? 
Therefore, author-level plans are opaque to Minstrel's transformation procedures, 
which operate on the schema representations. As a result, if PATs are going to be 
transformed, which is Minstrel*s primary engine for producing new stories, then they 
must be represented at the character level, rather than at the authorial level. 

In any case, once a theme has been selected, this adds a set of goals to the agenda: 


instantiating examples of the decision, connection, consequence, and context of the 





3Turner writes: “Although the same type of representation could be used for Minstrel's author- 
level plans, 1t would be clumsy and time consuming. Schemas for complicated computational actions 
such as looping, recursion, and so on would have to be defined and an interpreter built to perform 
these actions. Fortunately, Minstrel is built upon a representation for computation — Lisp. Rather 
than reinvent the wheel, Minstrel uses Lisp to represent its author-level plans and uses the Lisp 
interpreter to execute those plans” (p. 81). 
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PAT. Once transformation plans succeed in creating the sequence of story events that 
accomplish these goals, other goals can come into play. One set of secondary goals 
are called “drama goals” and include suspense, tragedy, foreshadowing, and charac- 
terization. A characterization goal, for example, would add a story scene illustrating 
that a character has an important personality element (e.g., makes decisions in haste) 
before the section of the story that illustrates the PAT. Another set of goals, “con- 
sistency goals,” fill out elements that aren't the bare-bones illustrations of the PAT. 
For example, if a knight kills another person, consistency goals makes sure that he 
is near the person first, and makes sure that he has an emotional reaction afterward. 
Finally, presentation goals make the final selection of the scenes that will be in the 


story, their ordering, and how they will be expressed in English. 


Minstrel's TRAMs 


At the heart of Minstrel's approach to generating stories is Turner's take on creativity, 
one which (like the structures of PAT's) is based on case-based reasoning: TRAMs. 
These “Transform-Recall-Adapt Methods” are a way of finding cases in the system 
memory that are related to the current situation and adapting elements of these 
previous cases for new uses. In this way stories can be generated that illustrate a 
particular theme without reusing previous stories verbatim. 

One example that Turner provides shows Minstrel trying to instantiate a scene of 
a knight committing suicide (though it is unclear which PAT this will help illustrate). 
Minstrel's first TRAM is always TRAM:Standard-Problem-Solving, which attempts 


to use a solution that already exists in memory. This TRAM can fail in two ways. 
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First, it is possible that there is no case in memory that matches. Second, it is possible 
that the matching cases in memory have already been used twice, which results in 
them being assessed as “boring” by the system — so a new solution must be found. 
For either type of failure, the next step is to transform the problem and look for a 
case matching the transformed problem. 

In Turner's example, Minstrel”s memory only contains the schemas for two episodes. 
In the first a knight fights a troll with his sword, killing the troll and being injured in 
the process. In the second a princess drinks a potion and makes herself ill. Neither 
of these is a direct match for suicide, so Minstrel must transform the problem. 

One possible transformation is TRAM:Generalize-Constraint. This can be used 
to relax one of the constraints in a schema. In this case, it is used to relax the 
requirement of a knight killing himself. This is the “Transform” step in a TRAM, 
and it is followed by the “Recall” step. Here the system searches for a scene of a 
knight killing anything — not just himself — and succeeds in finding the scene of 
the knight killing a troll. Since this was successful, the next step is to attempt to 
“Adapt” this solution to the new situation, by reinstating the constraint that was 
relaxed. The result is then assessed, and deemed appropriate, so Minstrel determines 
that the knight can kill himself with his sword. 

But this is only the most simple use of Minstrel's TRAMs, and the system finds 
other methods of suicide by a more complex route. For example, there is also 
TRAM:Similar-Outcomes-Partial-Change. According to Turner, this TRAM “rec- 
ognizes that being killed is similar to being injured” (p. 49) and transforms the 


schema to one in which a knight purposefully injures himself. This, however, returns 
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no matching cases. The knight fighting the troll is not retrieved, because the injury 
was accidental. The princess drinking the potion was not retrieved, because the actor 
was not a knight. But this does not cause Minstrel to simply give up on the direction 
proposed by TRAM:Similar-Outcomes-Partial-Change. Instead the TRAM process 
begins again, recursively, using the already transformed problem and applying a dif- 
ferent TRAM to it. In this next stage, by applying TRAM:Generalize-Constraint to 
the actor, it is able to find the princess drinking a potion to injure herself. It adapts 
by reapplying the generalized constraint to create a schema for a knight drinking a 
potion to injure himself, and then returns to the original TRAM. This adapts by 
changing from injuring to killing, and the result is an event of a knight drinking a 
potion to kill himself. This is assessed as successful, added to the story, and added 
to memory so that it can become a case retrieved by other TRAM processes. 

And that's not all — the TRAM:Similar-Outcomes-Partial-Change also helps gen- 
erate another plan for suicide when used as a second-level TRAM. In this case the 
first-level transformation is TRAM:Intention-Switch, which changes the schema from 
a knight purposefully killing himself to accidentally killing himself. When this, at the 
next level, is transformed from death to injury, the fight with the troll is found in 
memory. Minstrel then produces a story of a knight going into battle in order to die. 
With three different suicide methods found for the knight, Turner's example comes 


to an end as well. 
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Minstrel's stories 


Through various series of small, recursive transformations such as those outlined 
above, Minstrel is able to produce story events significantly different from any in its 
memory. While it can only elaborate as many themes as it has hand-coded PATs, 
with a large enough schema library it could presumably fill out the structures of those 
themes with a wide variety of events, creating many different stories. But enabling a 
wide variety of storytelling is not actually Turner's goal. He writes: “Minstrel begins 
with a small amount of knowledge about the King Arthur domain, as if it had read 
one or two short stories about King Arthur. Using this knowledge, Minstrel is able 
to tell more than ten complete stories, and many more incomplete stories and story 
fragments” (p. 8-9). We are told that accomplishing this requires about 17,000 lines 
of code for Minstrel, and another 10,000 lines of code for the tools package upon 
which it is built. 

With such elaborate processes, requiring so much time to develop and so many 
lines of code to implement, why starve Minstrel for data — only giving it the schemas 
equivalent to one or two short stories? Certainly no human storyteller was ever so 
starved for data. We all hear and read many, many stories before we begin to tell 
successful stories ourselves. Certainly the reason is not to achieve greater connection 
with Minstrel's underlying theories from cognitive science. In Schank's CBR theories 
an expert — such as an expert storyteller — is someone with access to a large body 
of cases that are effectively indexed and retrieved. 

One possible explanation is that starting with a small body of cases shows off 


Minstrel's creativity to greater effect. It ensures that TRAM:Standard-Problem- 
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Solving will be nearly useless when the program begins, so recursively-built solutions 
will be needed almost immediately. The number of stories the system is able to create 
(about ten) is also clearly much larger than the number it begins with (about two). 

But it is more likely that the complex, and in some ways fascinating model of 
Minstrel was also exceedingly brittle. It may have produced more and more mis- 
spun tales as more data was added to the system, due to the unpredictable emergent 
behavior encouraged by the TRAM system. Turner gives some indication of this 
when he reports on his attempt to add a new theme after the system was complete. 


Unfortunately, the story produced by PAT:PRIDE is seriously flawed: 


Once upon a time, a hermit named Bebe told a knight named Grunfeld 
that if Grunfeld fought a dragon then something bad would happen. 


Grunfeld was very proud. Because he was very proud, he wanted to 
impress the king. Grunfeld moved to a dragon. Grunfeld fought a dragon. 
The dragon was destroyed, but Grunfeld was wounded. Grunfeld was 
wounded because he fought a knight. Grunfeld being wounded impressed. 
the king. (p. 240, original emphasis) 

The problem arises because of the actions of a transformation called TRAM:Similar- 
Thwart-State, and Turner was able to revise this TRAM to remove portions of 
episodes that it was not able to adapt. But it is important to remember that this 
problem arose with the completed system (and not an incomplete one, as with Tale- 
Spin's mis-spun tales). Another error occurs when a knight kills and eats a princess, 
adapting a plan from a dragon (p. 278). Of course, a problem such as this could also 


be addressed with further changes to the system. But it seems likely that, as any 


further data was added to the system, more emergent behavior problems would keep 
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cropping up. Rafael Pérez y Pérez and Mike Sharples suggest something along these 


lines in their evaluation of Minstrel, writing: 


[T]he reader can imagine a Knight who is sewing his socks and pricked 
himself by accident; in this case, because the action of sewing produced 
an injury to the Knight, Minstrel would treat sewing as a method to kill 
someone. (2004, 21) 

But Pérez y Pérez and Sharples do not go nearly far enough with this statement. 
If Minstrel's only problems were of this sort — knights eating princesses, killing 
via sewing — then all that it would take to fix Minstrel would be a large body of 
encoded knowledge about the King Arthur domain. This knowledge would be used 
to reject such events at the TRAM's assessment stage. Such knowledge wouldn't be 
easy to produce, though perhaps much of it could be adapted from — or built atop 
— a project such as the Cyc Ontology (which will be discussed further in the next 
chapter). 

Building up this knowledge, however, would only serve to limit Minstrel?s errors 
to the much more profound type illustrated by the PAT:PRIDE story quoted above. 
Here Minstrel's adaptations result in completely losing the story thread — suddenly, 
Grunfeld, who was fighting a dragon, has been wounded because he was fighting 
a knight (and being wounded in this way apparently impresses the king). Turner 
was unable to fix this problem in any deep way, and instead resorted to having 
TRAM:Similar-Thwart-State simply cut out the parts of episodes that it was failing 
to adapt. Presumably, for the example above, this would involve cutting out the 
story?s entire conclusion. 


Here we can see, in Minstrel, symptoms of a much larger problem. One which 
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Turner, alone, could have done little to address. By the late 1980s it was clear that 
Al systems in general were not living up to the expectations that had been created 
over the three previous decades. Many successful systems had been built — by both 
“neats” and “scruffies” — but all of these worked on very small sets of data. Based 
on these successes, significant funding had been dedicated to attempting to scale 
up to larger, more real-world amounts of data. But these attempts failed, perhaps 
most spectacularly in the once high-flying area of “expert systems.” The methods 
of Al had produced, rather than operational simulations of intelligence, a panoply 
of idiosyncratic encodings of researchers” beliefs about humanity. Guy Steele and 
Richard Gabriel, in their history of the Lisp programming language (1993, 30), note 
that by 1988 the term “Al winter” had been introduced to describe the growing 
backlash and resulting loss of funding for many Al projects. 

While Minstrel looked, from inside Al, like a significant improvement over Tale- 
Spin — with an improved model of human cognition, and a smarter choice of which 
humans to simulate — from our current perspective the conclusion is, to put it 
charitably, debatable. Instead Minstrel looks like an inventively constructed Rube 
Goldberg device, massive and complex, continually reconfiguring a few small pieces at 
its center, and likely to create wildly unintended configurations if given a piece of even 
slightly different shape. It attempts to create more complete and structured fictional 
worlds than Tale-Spin, by bringing an author into the processes, but gives that author 
so little data to work with that its alternate universes are mostly uninhabited. The 
end result of trying to simulate a human author, even with compromises toward 


greater system functionality, is 27,000 lines of code that produce roughly 10 stories 
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of “one-half to one page in length” (p. 8). 


Universe 


As mentioned early in this chapter, the primary designer of Universe is Michael 
Lebowitz.* The Universe system shares a certain intellectual heritage with Minstrel 
and Tale-Spin, and it also has another unusual shared feature in common with Tale- 
Spin. As we saw with Tale-Spin, the most famous story attributed to Universe has a 
somewhat more tenuous connection to the project's output than one might assume. 


Here is the story: 


Liz was married to Tony. Neither loved the other, and, indeed, Liz was 
in love with Neil. However, unknown to either Tony or Neil, Stephano, 
Tony”s father, who wanted Liz to produce a grandson for him, threatened 
Liz that if she left Tony, he would kill Neil. Liz told Neil that she did not 
love him, that she was still in love with Tony, and that he should forget 
about her. Eventually, Neil was convinced and he married Marie. Later, 
when Liz was finally free from Tony (because Stephano had died), Neil 
was not free to marry her and their trouble went on. 


Though a number of prominent authors? provide this as an example of Universe's 
output, in fact this is a summarization of a plot from Days of Our Lives. It appears 
in a paper about Universe as “an illustration of the kind of plot outlines we would 


like to generate” (Lebowitz, 1985, 172). Unfortunately, Universe has never been able 





“Lebowitz's work on Universe was carried out while a faculty member at Columbia University, 
during which time (according to the acknowledgments in Lebowitz's 1984 and 1987 papers) work by 
Paula Langer and Doron Shalmon made significant contributions to the project and Susan Rachel 
Burstein helped develop many of the ideas. 


5Including Marie-Laure Ryan (1992, 246) and Janet Murray (1997, 201). 
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to display the level of mastery that the authors of Days of Our Lives — a remarkably 
popular and long-running daytime television melodrama, or “soap opera” — have 
achieved. However, this story does nonetheless point to a number of important ways 
in which the goals of Universe are significantly different from those of systems such 
as Tale-Spin and Minstrel. 

First, Universe is designed to to generate continuing serials: stories that never 
end. Second, Universe is working in one of the world's currently most popular story 
forms (television melodrama) rather than the somewhat archaic (and more difficult for 
contemporary audiences to judge) forms of Aesop-style fables and tales of Arthur's 
knights. Third, Universe's goals are defined in terms of what kinds of story and 
character structures it will generate, rather than in terms of the model of human 
cognition that the system's operations will simulate. 

The last of these is, perhaps, the most significant. While ideas such as “memory 
organization points” are important in the conception of Universe, the system is not 
presented as a simulation of a model of human cognition. Rather, it is presented as a 
means of generating a universe of characters and an ongoing plot that interconnects 
them. In fact, in lieu of any cognitive science theory, Lebowitz writes: “Our methods 
are based on analysis of a television melodrama” (Lebowitz, 1985, 483). This allows 
Universe to be designed specifically for the generation of stories, and of a particular 
style of stories, rather than for the simulation of the behavior believed to generate 
stories. 

As with Minstrel and Tale-Spin, story generation in Universe is organized via 


plans. In fact, Lebowitz goes so far as to claim, “Story generation can best be viewed 
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as a planning task” (1987, 234). But the goal of Universe's stories is to create complex 
interweavings of events between a core group of interrelated characters. Given this, 


before planning begins, Universe must create a cast of characters. 


Universe's characters 


A Universe story begins with the creation of characters, much as happens in Tale- 
Spin. But rather than a small number of characters who seem to come into existence 
at the moment the story begins, Universe creates an interconnected group of char- 
acters with somewhat detailed histories. This is accomplished through a character 
creation cycle. As outlined in Lebowitz's 1984 paper, the cycle begins with a few 
characters in a queue, soon to become the universe's (m/p)atriarchs, who need the 
details of their lives fleshed out. One character at a time is removed from the queue, 
and a simple simulation of that character's life is carried out — focusing on the gain- 
ing and losing of spouses, the birth of children, and the possibility of death — until 
the present is reached. Any new characters created through this process are added 
to the queue. New characters aren't created for each marriage, however, because the 
system may select an already existing eligible character (defined as single at the time 
of the marriage, of appropriate age and sex, and not directly related to the character). 
This begins to create interconnections between the families. 

Once this basic framework of marriage, birth, and death is filled in, each character 
is further fleshed out. This begins by giving each character a set of traits, some of 
which are inherited from their parents (if known), and selecting a set of stereotypes 


that work well to explain those traits. Stereotypes include many familiar character 
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elements (e.g., lawyer, doctor, gangster, big-eater, swinger, video-game-player) and 
have normal values for some traits and not others (e.g., “lawyer” has normal val- 
ues for intelligence, guile, and self-confidence, but not for religion, promiscuity, or 
moodiness). Following this, the system adds further detail to the characters” pasts 
by creating simplified versions of the sorts of events it will create in the present (once 


story generation begins). 


Universe's stories 


The model of planning in Universe is somewhat different than in the systems we have 
examined thus far. Because Universe is not aimed at producing stories that end, but 
rather serial melodramas on the model of Days of Our Lives, its plans are never aimed 
at bringing things to completion. In Tale-Spin, and most Al work on planning, the 
focus is on achieving goals: Joe Bear is hungry, and the planning process tries to get 
some food and ingest it so that his hunger will go away. In Minstrel the plans are to 
flesh out a PAT schema, meet the other goals, and complete the story. Universe, on 
the other hand, plans based on “character goals” and “author goals” that do not lead 
toward conclusions. Character goals are monitored to maintain consistency, while 
the primary impetus for story generation comes through author goals. The author 
has goals for maintaining an interesting story — Lebowitz talks about goals such as 
preserving romantic tension and keeping the story moving — with the result that 
Universe's plans can never specify a complete course of action, only one that seems 
appropriate given the current circumstances in the story?s universe. 


High-level author goals are carried out by lower-level goals, and planning for both 
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takes place through “plot fragments.” A higher-level goal to which Lebowitz gives 
particular attention is “churning” lovers, keeping them separated by new obstacles 
each time the previous set is cleared up. The forced marriage of Liz and Tony, on 
Days of Our Lives, is by Lebowitz regarded as a fragment that achieves (among 
other possible goals) the “churning” of Liz and Neil. This makes it apparent how 
character goals are treated quite differently in Universe as opposed to systems such 


as Tale-Spin. As Lebowitz writes about “churning”: 


Obviously this goal makes no sense from the point of view of the characters 
involved, but it makes a great deal of sense for the author, and, indeed, 
is a staple of melodrama (“happily ever after” being notoriously boring 
in fiction, if not in life). Universe has a number of other plot fragments 
[besides forced marriage] for achieving this goal, such as lovers” fights and 
job problems. (Lebowitz, 1985, 488) 


Universe maintains a representation of outstanding author and character goals. 
The storytelling cycle begins with choosing an author goal that has no unmet precon- 
ditions. A plot fragment is selected that will achieve that goal, with preference given 
to fragments that also achieve other goals that are current. This is plot fragment is 
then made part of the story — producing new characters, events for output, and new 
goals as appropriate. Even “forced marriage” is a relatively high-level plot fragment, 
which needs to be filled out with lower-level fragments for the woman dumping her 
husband, the husband getting together with another woman, the threat from the 
parent being eventually eliminated, and so on. The potential choice of a number of 
different fragments and characters for each of these elements increases the variability 


of the story structures Universe produces. 
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As this process takes place, Universe doesn't simply choose characters and plot 
fragments randomly. First, the personalities and plans of characters constrain which 
can play roles in the fragments (and, further, some fragments require the partici- 
pation of characters that have particular stereotypes). Second, with each fragment 
Universe tries to select events and include characters that will help meet other active 
authorial goals. This helps create complexly interwoven plots, such as those of serial 
melodramas, in which events often contribute to several active storylines. 

Table 6.1 is an example of an actual Universe output for a forced marriage sto- 
ryline, using the same characters as the Days of Our Lives plot summary above 
(Lebowitz, 1985, 491). Those lines of the output that begin “>>>” represent low- 
level events, whereas other text provides a trace of the planning process. The system 
begins with two active goals: to churn Liz and Neil, and to get Neil together with 
Renee. 

Other plot fragments that Universe can use for churning include LOVERS-FIGHT, 
JOB-PROBLEM, PREGNANT-AFFAIR, ACCIDENT-BREAKUP, STEAL-CHILD, COLLEAGUE- 
AFFAIR, and AVALANCHE-ACCIDENT. The variations on these depend on the charac- 
ters involved. For example, in Lebowitz's 1987 paper he shows output from churning 
Joshua and Fran. Given their jobs, they can experience the job problems of BUREAU- 
CRAT and SLEAZY-LAWYER. Given other aspects of their characters, they can fight 
about IN-LAWS, MONEY, SECRETS, FLIRTING, and KIDS. 

While Universe (at the time of Lebowitz's 1987 paper) only contained 65 plot 
fragments, it was already enough to generate stories with more variety than Minstrel 


and more structure than Tale-Spin in their completed states. Further, its fictional 
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*(tell *(((churn liz neil) (together renee neil)))) 


working on goal — (CHURN LIZ NEIL) 
Several plans to choose from FORCED-MARRIAGE LOVERS-FIGHT JOB-PROBLEM 
— using plan FORCED-MARRIAGE 


working on goal — (DO-THREATEN STEPHANO LIZ “forget it”) 
— using plan THREATEN 


>>> STEPHANO threatens LIZ: “forget it” 


working on goal — (WORRY-ABOUT NEIL) — using plan BE-CONCERNED 
Possible candidates — MARLENA JULIE DOUG ROMAN DON CHRIS KAYLA 
Using MARLENA for WORRIER 


>>> MARLENA is worried about NEIL 


working on goal — (TOGETHER * NEIL) 

Several plans to choose from SEDUCTION DRUNKEN-SNEAK-IN 
SYMPATHETIC-UNION JOB-TOGETHER 

Possible candidates — DAPHNE RENEE 

Using RENEE for SEDUCER 


>>> RENEE seduces NEIL 

working on goal — (ELIMINATE STEPHANO) 

Several plans to choose from ATTEMPTED-MURDER EXPOSE 

— using plan ATTEMPTED-MURDER 

Using ALEX for KILLER 

>>> ALEX tries to kill STEPHANO 

working on goal — (DO-DIVORCE TONY LIZ) — using plan DIVORCE 


>>> LIZ and TONY got divorced 


working on goal — (TOGETHER LIZ NEIL) 
no acceptable plans 


Table 6.1: Universe Output 
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worlds were much more complex and complete than those of either system, despite 
the fact that Minstrel was not finished until significantly after this point in the devel- 
opment of Universe. In short, eschewing the simulation of human cognitive processes 
was a demonstrably powerful outlook, from the perspective of fiction — but where 


did it leave the Al researcher? 


What kind of author? 


Though it is clear that Universe is not designed to simulate a human author through 
its internal processes, at the same time the main processes it uses for storytelling are 
referred to as “author goals.” This may lead one to wonder, “Which author is that?” 

Also, while no cognitive science model of creativity is given as the basis of the 
system's design, Lebowitz still gestures toward the cognitivist view of Al, writing in 
one paper that a goal of Universe is “to better understand the cognitive processes 
human authors use in generating stories” (1984, 172) and in another that we “can 
expect research into extended story generation to ... give us insight into the creative 


mechanism” (1985, 484). Exactly how this will take place is never explained. 





S However, in one publication about the system (1985) Lebowitz writes: 


This is precisely the way that some authors work in developing novels: create a set of 
characters and work from there. For example, Eco says that “What I mean is that to 
tell a story you must first of all construct a world, down to the slightest details.” He 
goes on to describe how this “world” must include “lists of names and personal data 
for many characters.” (p. 486) 


Lebowitz is quoting Umberto Eco's postscript to The Name of the Rose. And, unfortunately, 
even this half-hearted attempt is troubled. While Eco's postscript does say that he eventually found 
a 1975 notebook in which he had written down a list of monks, the postscript's primary account of 
the genesis of the book reads: “I began writing in March of 1978, prodded by a seminal idea: I felt 
like poisoning a monk” (Eco, 1994, 509). In short, if Eco's process for The Name of the Rose bears 
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It is understandable that, less than a decade out of Schank's lab, Lebowitz was 
unable to entirely drop cognitivist language in discussing his Al projects. In fact, to 
some extent, publication in Al journals and at Al conferences may have demanded 
it. In Lebowitz's last paper on Universe, at the 1987 conference of the Cognitive 
Science Society, he even suggests that case-based reasoning may be a future direction 
for Universe's plot generation (p. 240). 

But the next year Lebowitz left Columbia's faculty, and by the publication of 
Ray Kurzweil's book The Age of Intelligent Machines (1990) Lebowitz was riding 
out the Al winter in a rather different context. In the book, accompanying a capsule 
description of Universe, he is listed as a vice president of the Analytical Proprietary 
Trading unit at Morgan Stanley and Company (p. 390). 

And it was only a few years later, in the early 1990s, that a set of Al techniques 
with no pretense to modeling human intelligence would rocket into the public con- 


sciousness: statistical techniques. 


Statistical Techniques 


In the early 1990s the World Wide Web was largely human-organized. The “What's 
New” page at NCSA provided regular links to new web resources (much like one of 
today's blogs) and Yahoo — still on servers at Stanford University — maintained a 


growing hierarchical directory, to which pages were added by hand. Many individuals 





a resemblance to the way that Universe stories begin, it is probably at the level of plot fragments 
(poisoning a monk is a fragment) rather than in developing the world before telling the story — and 
in either case it is quite weak. 
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maintained hotlists, or pages of links, that were the definitive sources for finding 
the web's best information on certain topics. Much as pre-Dewey librarians had 
done before them, and Al researchers more recently, these editors of web resources 
organized knowledge according to their own ideas (often no more than intuitions) of 
what belonged with what. Web page “meta tags” helped authors express what they 
felt were the relevant keywords for their contributions, and page titles were a very 
important factor in determining which results came back from a search. Generally, 
finding the resources with “cool shades” next to them in the Yahoo directory was a 
better route to information than using the clumsy search engines. 

But this began to change, and quickly, as search engines improved. This improve- 
ment came not by engines coming to better “understand” the pages they indexed, 
not by better interpreting document contents as traditional Al researchers had at- 
tempted, but by counting. For example, the “PageRank” algorithm that rocketed 
Google to success operates by counting the links that interconnect web pages. As the 


Google website explains: 


PageRank relies on the uniquely democratic nature of the web by using 
its vast link structure as an indicator of an individual page's value. In 
essence, Google interprets a link from page A to page B as a vote, by 
page A, for page B. But, Google looks at more than the sheer volume of 
votes, or links a page receives; it also analyzes the page that casts the 
vote. Votes cast by pages that are themselves “important” weigh more 
heavily and help to make other pages “important.” 


http://www.google.com /technology / 


Of course, describing this process as “democratic” is a rhetorical choice on the 


part of Google's marketing department. When the web's most “important” pages are 
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almost all owned by media and technology conglomerates, the power to bless view- 
points on widely-discussed topics (e.g., national politics) with attention via PageRank 
lies in much the same hands as the power to grant attention does in non-web media. 
If democracy was the goal, one can imagining weighting Google's statistical model 
somewhat diffterently. Nevertheless, even if it is possible to debate whether PageR- 
ank is democratic, there is no debating that it is statistical. It is based on counting 
information about pages (links in and out) and interpreting the resulting numbers. 
We now take for granted the ability of a computer program to find useful documents 
on a subject that interests us — though it would have seemed an amazing feat of Al 
not that many years ago. 

The success of another high-profile web business is also partially tied to a statis- 
tical Al technique. In this case, Amazon's recommendation system. Amazon doesn't 
decide which books and movies to recommend to its customers by deeply analyzing 
the content of each item it sells and “noticing” similarities between them. Nor does 
it simply follow rules that dictate recommending, for example, other books by the 
same author. Instead, it looks at the numbers: the numbers of people who bought 
an item and also bought another item, or who looked at a product page and also 
looked at another product page (Linden, Smith, and York, 2003). It is these types 
of statistical correlations that let Amazon know to (as of this writing) recommend 
the television show Veronica Mars to those who visit the page for Wonderfalls — 
even though there is no overlap in the shows” creators or stars, one is set in upstate 
New York and the other in Southern California, and the main character of one is a 


college-educated woman who works in a gift shop while the other is a mystery-solving 
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high school student. Companies like Amazon build such statistical models not only in 
an attempt to improve sales of these items, but also because this type of information 
about products and their purchasers is a saleable commodity in itself. 

Looking beyond applications such as Amazon's, statistical Al techniques can also 
engage with human expressions, not just recognize what correlates with them. Sta- 
tistical techniques have improved computer speech recognition to the point where 
automatic dictation (and automated phone systems, with their frustrations) are a 
practical reality. And statistical techniques don't just help computers recognize what 
words people speak, they can also analyze patterns of words. The most successful 
email spam filters, for example, are built using statistical AI techniques. 

And, of course, statistical techniques can also be used to interpret and generate 
individual pieces of text. An earlier chapter looked in depth at one concept useful in 
a number of statistical approaches to language: n-grams. No one has yet created a 
major natural language or story generation project that uses statistical Al techniques 
as a primary process, but statistical approaches have now reached such a level of 
importance that no one working with human language can fail to address them, and 


no one working in Al has failed to notice the shift in gravity. 


Natural language processing revisited 


As discussed in the previous chapter, the branch of computer science focused on hu- 
man language is Natural Language Processing (NLP). On some level, human language 
is a perfect fit for the digital computer. As may be recalled from an earlier chapter, 


digital computers operate on discrete elements — like numbers — rather than the 
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continuous representations of analog computation. This makes digital computers well 
suited to computing text, given that text, too, is a system of encoding that uses dis- 
crete elements — like letters. Further, language has grammatical structures (obeyed 
by most texts) and these, too, are amenable to our current models of computation. 

On the other hand, human language is a messy and complicated affair. Words 
do not always mean the same thing — the same sequence of letters doesn't encode 
the same meaning — from sentence to sentence, place to place, or time to time. It 
can be hard to even tell what is a word (though making best guesses is necessary 
for the many algorithms that employ word “tokenization”) and, at times, groups of 
two or more words may actually be functioning as a single word (what linguists call 
“collocations”). And, of course, much human language behavior is in the form of 
speech (rather than text) and here we enter the realm of continuous (rather than 
discrete) encodings, a much more difficult problem. 

We might characterize the two main approaches to dealing with this dual na- 
ture of language computing as “structure first” and “mess first.” The structure-first 
approach, discussed in the previous chapter, largely follows the sort of rationalist 
approach to language advocated by Noam Chomsky. In Chomskyian linguistics, 
coming to understand the general structure of linguistic behavior — cutting across 
time, place, and specific languages — helps linguists grapple with the specifics of 
understanding and producing particular linguistic acts. This is where the first major 
work in natural language processing, such as that in Schank's lab, took place. 

The mess-first approach, on the other hand, follows what has been called an 


empiricist approach to language. It tries to, rather than understand by fitting partic- 
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ular linguistic acts to already-understood structures, draw meaningful patterns out 
of examinations of linguistic data. These linguists produce and analyze (often large) 
bodies of linguistic data in order to enable such work, and one of these bodies is 
usually called a corpus. Such techniques were basically impractical until the 1970s, 
when appropriately powerful computing resources began to be affordable for this sort 
of research. Just as happened with the web, the 1990s saw an explosion in the use of 
these “statistical” techniques for language as it was demonstrated that they were able 
to achieve more accurate results, in many contexts, than the traditional techniques 
based on rules for grammatical and/or semantic analysis. Of course, as it turns out, 
the work of drawing meaningful patterns out of masses of textual data (and then 
employing them) is often best accomplished by those who use some knowledge of 
grammatical structure in the design of their processes — with the result that much 
work in statistical NLP contains elements of the more traditional work. 

Two major areas of activity in NLP are Natural Language Understanding (NLU) 
and Natural Language Generation (NLG). NLU is focused on the interpretation of 
human language, while NLG is focused on the production of language for human 
audiences. Some digital fictions, such as interactive characters and fictions to which 
the audience responds textually, employ NLU. In this study, however, our primary 


focus will be on NLG. 


Statistical techniques for language 


As the authors of the textbook Foundations of Statistical Natural Language Processing 


(Manning and Schitze, 1999) explain, “statistical” NLP techniques are not exactly 
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those that might be assumed by those who have taken an introductory statistics 


course. Rather: 


Statistical NLP as we define it comprises all quantitative approaches to 
automated language processing, including probabilistic modeling, infor- 
mation theory, and linear algebra. While probability theory is the founda- 
tion for formal statistical reasoning, we take the basic meaning of the term 
“statistics? as being broader, encompassing all quantitative approaches to 
data (a definition which one can quickly confirm in almost any dictionary). 
(p. xxxi) 


The idea of quantitative approaches to linguistic data has an uncomfortable sound 
for many writers. And, in fact, the simplest technique of statistical NLP — counting 
the frequencies of words in a corpus — has been explicitly derided as a starting point 
for literary interpretation by Oulipian writer Italo Calvino.” And yet, by engaging 
deeply with the specifics of language, statistical techniques may actually open more 
“writerly” alternatives to the mechanistic formulations of many structure-oriented 
NLG systems. Of course, to make such use writers may need to employ the tools of 
statistical NLG in ways that depart from those common in science and industry. 

Recall, for example, the “n-grams” or “Markov chains” demonstrated by Shannon. 
These have been of use to those creating practical NLU and NLG tools, and — as 
we saw two chapters ago — also useful to those engaged in literary experimentation. 
In both cases, in keeping with the basic quantitative approach of statistical NLP, 
a corpus of language is examined (either in its entirety, or through partial random 


survey) and a model of the chains of words that run through it is developed. For 





“In his novel If on a winter's night a traveler (1981). 
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example, the software might gather data on all the chains of two or three words that 
exist in a corpus, as well as the number of times that each appears.? 

There are a number of potential situations in which such a model might be of 
practical use. For example, an NLU system may have trouble interpreting a particular 
word, or an NLG system may have trouble choosing which of several possible words 
to use at a particular point in a sentence. If a corpus of similar language has been 
examined, and a model of its word chains developed, this may help. Given that 
NLP researchers and system developers commonly work with corpora of millions of 
words, chances are good that a model will reveal that some words appear much more 
frequently after the sequence of words understood /generated just before that point. 
Of the words being considered by the NLU/NLG system, if one of them is significantly 
more common it is probably the one to choose. 

On the other hand, always choosing the most common word sounds like a recipe 
for mediocre language expressing mainstream sentiments — which would be deadly 
in most creative contexts. Except that, remember, we are speaking of the most 
common word in the examined body of linguistic data. That data could be carefully 
shaped, so that a process operating on word chains would find a preponderance of 
connections considered interesting by an author — rather than those that are most 
everyday. This type of shaping of data would (while, like the constrained writing 
which makes Queneau's Poems work, a data-focused technique) be quite different 


from traditional literary authorship. For one thing, writers are more accustomed to 





80Or, more likely when working with large amounts of language, the chains for all words except 
those that appear only once — which will reduce the size of the data without doing much to alter 
the quality of the model. 
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working with thousands of words, rather than millions — but having a large body of 
text would be desirable, as a smaller corpus would limit the techniques that can be 
used. Also, statistical techniques often find unexpected patterns, invisible to normal 
practices of reading, within bodies of text. For an author this can be both delightful 
and frustrating. 

A related problem with purely statistical techniques is that, in their focus on the 
specifics of language, they also become difficult to use when one seeks to operate at 
higher, structural levels. Some of this was seen two chapters ago, in the work using 
n-gram techniques for textual generation, much the way Shannon used them. Here 
we will consider an approach to statistical techniques that aims to apply them in 
contexts much more structured than Shannon's game — which would be necessary, 
for example, if statistical techniques were to be used in generating a story system's 


output. 


Selection through statistics 


It is true that standard NLG techniques are used with most story generation systems, 
but the domain of poetry can't help but accent the shortcomings of traditional ap- 
proaches. Poetry foregrounds the heretical possibility that language must come first 
in some generation processes, or at least that there must be a back and forth between 
the “realization” of specific text and the processes shaping the overall structure. This 
leads those with an interest in poetry (and in fiction for which the specifics of lan- 
guage are important) to consider critiques of the linear “pipeline” architecture for 


NLG systems outlined in the previous chapter. 
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As it turns out, the standard pipeline architecture presents problems beyond the 
realm of poetry. Its limitations are apparent in many applications that include spe- 
cific goals for the resulting surface text, ranging from controlling idiomatic construc- 
tions to aiming toward a particular document length. In Hisar Maruli Manurung's 
dissertation, An Evolutionary Algorithm Approach to Poetry Generation (2003), he 
summarizes the pipeline approach's limitations by pointing out that the elements 
of text generation are not actually independent, and when moving through a linear 


process ... 


making choices in one aspect can preclude the possibility of choices in 
other aspects. When these decisions are made by the separate modules in 
the pipeline architecture ... the resulting texts may be suboptimal, and 
in the worst case, the system may fail to generate a text at all. This 
problem has been identified in Meteer (1991) as the “generation gap”, in 
Kantrowitz and Bates (1992) as “talking oneself into a corner”, and also 
in Eddy (2002), who notes that it is not easy to determine what effect a 
decision taken at an early stage in the pipeline will have at the surface, 
and decisions taken at one stage may preclude at a later stage a choice 
which results in a more desirable surface form. (p. 36) 


A number of solutions have been pursued by the NLG community, even as the 
mainstream of research has remained in the pipeline model. Alternative formulations 
include changes to the standard pipeline arrangement (including the “revision,” “feed- 
back,” and “blackboard” architectures) as well as the more radical step of an “inte- 
egrated” architecture that dispenses with many of the boundaries between components. 
Within the group pursuing integrated architectures, Manurung points particularly to 
those who formulate NLG as a constraint satisfaction problem — searching through 


the “space” of possible solutions by making alterations to an initial (presumably not 
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optimal) generation, seeking a text that meets as many of the system's semantic and 
pragmatic goals as possible. A particular kind of search that incorporates random 
elements, known as an “evolutionary algorithm,” is the approach Manurung takes. 
At least as interesting to us, however, is the set of approaches known as “overgen- 
eration and ranking” methods. With these methods, rather than using traditional or 
alternative techniques to generate one “best” text, many possibilities are generated. 
Then the final text is selected by using statistical techniques to evaluate the different 


possibilities. 


Language models 


The evaluation method used by most of these systems employs what is known as a 
“language model.” As with most statistical models, the primary use of a language 
model is to predict the future (linguistic) behavior of a system based on a sample of 
its past (linguistic) behavior. Constructing such a model requires two efforts. The 
first is deciding what to pay attention to: what “features” of the past behavior will 
be treated statistically. The second is deciding how to make those features part of an 
operating statistical model. 

The basic n-gram approach of Shannon's process can serve as a basis for a language 
model. In fact, as mentioned earlier, it is a quite popular one. In this case, the features 
are simply the word or two that appear before the current one. Then a simple model 
would be: the different following words are assigned likelihoods based on how often 
they appear after the preceding ones within the sample data. In actuality, the models 


tend to be somewhat more complicated. This is in part because, even after training 
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on millions of words, new pairs and triplets of words will appear regularly even in data 
from the same source (say, the same newspaper from which the model was developed). 
This requires that the model be “smoothed” so that newly-appearing digrams and 
trigrams won't be assigned zero probabilities. 

N-grams are remarkably effective, given how simple the technique is and how non- 
specific it is to the domain of language. One could imagine models more specific to 
language, and in fact there has been a growing call for researchers to “put language 
back into language modeling.” This would require models that work with features 
other than small groupings of undifferentiated word tokens. Researchers have exper- 
imented with approaches such as decision trees and link grammars, but perhaps the 
most attention has been given to “maximum entropy” modeling (Berger, Pietra, and 
Pietra, 1996). Maximum entropy is another technique, like Markov chains, that has 
seen use in many domains — but its flexibility allows for the selection of many types 


of features, including those that are specific to language. 


Nitrogen 


The seminal system for the “overgeneration and ranking” approach is Nitrogen, from 
USC's Information Sciences Institute (Knight and Hatzivassiloglou, 1995; Langkilde 
and Knight, 1998; Langkilde, 2000). Nitrogen was developed in a machine translation 
context, working from Japanese to English. The input to the system, rather than 
being the fully explicit “messages” of Reiter and Dale's content determination stage, 
came from attempts at automated interpretation of Japanese newspaper texts. Given 


this, Nitrogen was designed to operate in an environment in which the semantic 
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input contained errors and ambiguities. Further, the domain of newspaper texts is 
so large that it would be difficult to create a lexical and grammatical knowledge base 
that would allow for principled decisions in all cases (a situation that leaves most 
traditional NLG systems falling back on defaults — e.g., using “the” as the default 
article). 

Nitrogen approached this problem by generating many possible texts, and then a 
language model was used to choose between them. At first the many potential versions 
of each sentence were represented as a “word lattice” — a set of many possible paths, 
diverging and converging, from the beginning to the end of the sentence (or, to put 
it another way, an “an acyclic state transition network with one start state, one final 
state, and transitions labeled by words”). The construction of this lattice didn't 
attempt to capture grammatical information such as agreement. So to get from “it” 


”» « » 
y 


to “not” there might be three paths: “am,” “are,” and “is” (even though only “it is 
not” is potentially correct). The language model would then be used to rank different 
possible paths through the lattice.? 

Nitrogen's language model was based on n-grams. Sometimes it was remarkably 
successful, and other times not. For example, here are the top six candidates it chose 
for one lattice (a random path through which results in: “Betrayal of an trust of 
them by me is not having a feasibility”): 

I cannot betray their trust . 


I will not be able to betray their trust . 


I am not able to betray their trust . 





“Later the lattices were replaced a “forest” structure (a packed set of trees) but the basic 
approach, representing many possible versions of a sentence, remained the same. 
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I are not able to betray their trust . 
T is not able to betray their trust . 
I cannot betray the trust of them . 

And here are the top six candidates Nitrogen chose for another lattice (a random 
path through which generates: “Visitant which came into the place where it will be 
Japanese has admired that there was Mount Fuji”): 

Visitors who came in Japan admire Mount Fuji. 
Visitors who came in Japan admires Mount Fuji . 
Visitors who arrived in Japan admire Mount Fuji. 
Visitors who arrived in Japan admires Mount Fuji . 
Visitors who came to Japan admire Mount Fuji . 

A visitor who came in Japan admire Mount Fuji . 

Despite these uneven results, Nitrogen struck a number of researchers as a promis- 
ing first step. And from our point of view it is interesting to see that structural and 
statistical approaches can come together in this way. It is certainly possible to imag- 
ine building a language model based on something other than newspaper text — 
something that might help choose sentences from forests of overgeneration that bet- 
ter match the intent of a particular creative project. And Nitrogen also demonstrates 
some of the the diversity of ways that the simple n-gram approach can be used within 


linguistic applications. 


Layers of contributions 


As noted above, Manurung chose a somewhat different path from Nitrogen's. With 


his evolutionary algorithm approach, he tells us, he produced no poetry. Or, as he 
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puts it: 


Note that the production of “genuine” poetry, i.e. texts which members of 
the general public can observe and with a high degree of certainty state 
are indeed poetry, is not the goal of this study. (p. 208) 


To some readers, especially literary critics and writers, this may seem an im- 
mensely odd statement to find in a dissertation titled An Evolutionary Algorithm 
Approach to Poetry Generation. What, then, is Manurung's contribution? 

The key here is the word that lies in the center of Manurung's title. His disserta- 
tion doesn't present a system that produces poetry, but it presents a new “approach” 
to poetry generation. As he does this, he takes some processes that hadn't previously 
been used toward literary ends, or perhaps even toward linguistic ends, and demon- 
strates that they hold promise for those attempting to think through new solutions 
to text generation. Nitrogen, similarly, demonstrates that a different type of process 
can be applied to text generation — that the task of generating text need not be 
shaped as is commonly assumed. 

These are creative acts. In time, the fruits of this creativity may include new liter- 
ary works built upon these techniques and further developments along these lines. If 
this happens, the “processes” of these works will need to be recognized as historically 
rich composites, embedding creative contributions from diverse people at different 
moments. 

And, as it happens, two chapters ago we developed a vocabulary for discussing 
creative contributions such as these. Manurung's work asserts that evolutionary al- 


gorithms can operate as a literary and linguistic logic, and he has created an imple- 
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mentation that provides an only partial demonstration. Those working on Nitrogen 
have defined a certain abstract process for combining linguistic logics from rule-based 
and statistical traditions. T'heir implementation, too, leads to an only partial demon- 
stration. In both cases, the partial demonstration may be enough to inspire further, 


perhaps more fully successful, work in the same direction. 


Sentences and stories 


It is no surprise that both Manurung and the creators of Nitrogen chose to explore 
statistical techniques for generating structured text, and no surprise that their suc- 
cesses were only partial. In our era statistical Al is ascendent: solving problems 
that once seemed uncomputable, finding more and more inroads into our daily lives. 
(When the keypad of a mobile phone attempts to predict what you're typing, to speed 
the process of producing the most commonplace messages possible, that is another 
instance of statistical AI.) But, at the same time, it remains persistently difficult to 
use statistical techniques for generating complex structures — such as the structures 
of sentences, and the structures of stories. 

For both sentences and stories, better results are still achieved by processes that 
are specified explicitly by humans, rather than learned through the statistical exam- 
ination of large pools of data. And the tools of traditional Al — both scruffies and 
neats — are powerful formulations for specifying such processes. Given this, such 
tools remain in use, but in a different environment. The successes of statistical Al, 
hard on the heels of “Good Old-Fashioned” AP's (GOFAT's) winter, have shaken to 


the core the belief that these tools can claim a special status. They are now simply 
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ways of building a system, almost like any other. And, in this environment, people 
have begun to regard the task of story generation quite diffterently — as we'll see in 
the next chapter. 

At the same time, even if the belief in a special place for GOFAT's tools has been 
profoundly shaken, other beliefs in the AI community have proven more difficult to 
disrupt. Specifically, despite the successes of statistical AI, many in the AI community 


remain committed to an anthropomorphized view of processes. 


Things That Think (Like Us) 


Given the history of Al, it is no surprise that systems such as Tale-Spin and Minstrel 
were built to embody models of human cognition, or that the author of Universe 
suggested that his system could offer insight into the human creative process. The 
assumption that human and machine processes should — or must — resemble each 
other runs deep in Al. It continues to this day, despite the counter-example of statis- 
tical Al. 

With Tale-Spin, Minstrel, and Universe all emerging from the “scruffy” end of 
symbolic Al, we might assume that this area of Al was particularly given to building 
its systems on human models. And perhaps it is true that a neat researcher would not 
have made Turner's opening assumption from his description of Minstrel: “To build 
a computer program to tell stories, we must understand and model the processes an 
author uses to achieve his goals” (1994, 3). And, in fact, scruffy researchers sought to 


claim for their approach greater fidelity with the way that humans approach problems. 
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For example, Meehan writes that, according to Schank and Abelson's theories, “there 
are several different techniques which people use in solving problems. A problem 
solver based on theorem proving, by contrast, might assume that there was, in essence, 
one uniform method to use in proving theorems, resolution, for instance” (1976, 29). 


Riesbeck and Schank argue similarly: 


The problem that general problem solver programs have is that each and 
every problem presented to them has to be handled the same way. Each 
problem must be reduced to first principles and logical deductions made 
from those principles. Even mathematicians do not really behave this 
way. (Riesbeck and Schank, 1989, 9) 


But to view the issue in these terms is to consider only one level. At another level, 
both scruffy and neat Al are based on the assumptions of cognitive science — espe- 
cially that effective problem solving for machines should be based on the same model 
of “symbolic processing” assumed to be at work in humans. As Paul N. Edwards 
puts it in The Closed World, “cognitive science views problems of thinking, reason- 
ing, and perception as general issues of symbolic processing, transcending distinctions 
between humans and computers, humans and animals, and living and nonliving sys- 
tems” (1997, 19). To put it another way, whatever their differences, scruffy and neat 
researchers still speak of their computer systems in terms of human-derived symbolic 
formulations such as beliefs, goals, plans, rules, and so on. 

Of course, both scruffy and neat GOFALI researchers have been extensively cri- 
tiqued by more recent thinkers. But many of the critics who locate themselves within 
Al view the problem with symbolic techniques as a lack of accuracy regarding how 


humans operate. For example, Philip E. Agre, in his book Computation and Human 
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Experience, characterizes symbolic Al's problems by saying, “Mistaken ideas about 
human nature lead to recurring patterns of technical difficulty” (1997, xii). His alter- 
native approach to Al is founded on a different view of human activity (p. 7) rather 
than abandoning the anthropomorphic model for Al processes. 

A more radical critique may appear to emerge from the current head of MIT's 
venerable Al laboratory, Rodney Brooks. In his widely-cited “Elephants Don't Play 
Chess” Brooks writes: “In this paper we argue that the symbol system hypothesis 
upon which classical AÍ is based is fundamentally flawed, and as such imposes se- 
vere limitations on the fitness of its progeny” (1990, 3). The alternative championed 
by Brooks is focused on Al that operates through combinations of simple mecha- 
nisms embodied in robots in real-world contexts (and focuses on interaction within 
these contexts). Nevertheless, after building a number of insect-like robots based on 
this research agenda, Brooks turned — inevitably? — to the creation of humanoid 
robots meant to eventually exhibit humanoid intelligence. The banner headline on 
the webpage for the most famous of these projects, Cog, reads, “The motivation be- 
hind creating Cog is the hypothesis that: Humanoid intelligence requires humanoid 
interactions with the world” (MIT Humanoid Robotics Group, 2003). The anthro- 


pomorphic view of Al asserts itself again. 


Simulation versus authorship 


From the point of view of authoring, there is something quite puzzling about all this. 
To a writer it is obvious that people in stories do not talk the way people do in real 


life, their lives as reported in stories contain only a tiny subset of what would go on 
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if they had real lives, and so on. A system that operated largely like a real person 
thinks might not be much help in telling a story — most real people aren't that 
good at storytelling (so simulating an author is out) and characters shouldn't act like 
real people (real people don't take the stylized, coordinated actions that dramatic 
characters perform). And, similarly, as discussed above, a great leap forward in 
statistical language generation might not be much help either, in that its models 
would doubtless be aimed at reproducing our culture's most pedestrian language. 

Accurate modeling is simply not what authors are after. This is true even in 
areas such as computer games. While first-person 3D games (such as Half-Life) may 
incorporate very natural-seeming models of space, movement, physics, lighting, and 
so on, each of these elements have been (along with each of the environments in which 
they are active) carefully shaped for the game experience. Half-Life may be more 
like the real world than is a game like Super Mario Brothers, but a deep model of the 
world, operating just like the real world, is (in addition to not being achievable) not 
an important step along the path to either one. 

At the same time, it may be difficult to imagine any other step forward for story 
generation. What guiding principle could there be, other than increasingly better 
models (symbolic or statistical) of the story creation and language generation pro- 
cesses? Universe seems like a first, if somewhat ambivalent, step — but along which 
path? FEschewing the author entirely seems the wrong direction. Relocating the 


author, the strategy adopted by the next chapter?s projects, is more promising. 


Expressive Intelligence 


Uniting Data and Process 


From a process-oriented perspective, systems such as Queneau's One Hundred Thou- 
sand Billion Poems and Strachey's love letter generator are trivially simple. They 
succeed — to what extent they do — because the simplicity is appropriate to their 
concept. On some level, the projects are about simple recombination. Further, the 
data employed by each work is shaped to work with these simple combinatorial pro- 
cesses, creating audience-oriented surfaces appropriate to each work's goals. 

In fact, most digital works are of this variety. They present combinations of data 
based on relatively straightforward processes, with the audience experience succeeding 
or failing on the strength of the data and its fit with the processes. Even if the 
3D animation engine driving a first-person computer game is immensely process- 
intensive, from the perspective of computer graphics, the appearance and dialogue of 
most characters (allies, enemies, bystanders) will be based on a collection of carefully- 
crafted data (image files, recorded lines of dialogue) sequenced in simple ways. The 


same is true of the world maps (levels), the objects that populate them, and so on. 
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Projects like Tale-Spin, Minstrel, and Universe, on the other hand, embody much 
more complex processes. But their potential power is limited by the design of their 
processes (which, as we have seen, is somewhat problematic) and by their data (which 
is even more problematic). Even if Tale-Spin was a more successful process, it would 
have to contain very wittily-conceived data to make us interested in stories about 
bears asking around after berries. Its data is, instead, a set of high-level representa- 
tions (so no cleverness is possible in the specific use of language) of a set of boring 
facts (combined with complex planning structures almost entirely hidden from the 
audience). Minstrel's data is largely absent. Universe's process and data only op- 
erate at the structural level — never specific enough to benefit from the skill with 
language of someone like Queneau. 

This chapter begins by looking at two projects that attempt to bridge the gap 
between carefully-crafted data and complex, expressive processes. Brutus and Ter- 
minal Time combine powerful structures that build on the work of earlier Al story 
generation systems with meaningful opportunities for writer-crafted data to shine at 
the surface. As we examine the second of these projects we will also begin to “fore- 
ground” the topic of interaction, reintroducing it into this study?s considerations 
(after it was shifted to the background in the introductory chapter). In addition, we 
will evaluate Michael Mateas's approach of “Expressive AI.” Its proposed role for the 
author is convincing, and its vocabulary of “authorial afftordances” and “interpretive 
aftordances” seems, like Chris Crawford's “process intensity,” to occupy a conceptual 
ground that can help our field move beyond unhelpful perceived divisions between 


the concerns of artists and scientists. 
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Brutus and Terminal Time 


The first publications about the Brutus and Terminal Time systems appeared in 
the late 1990s, and these two systems have continued to reach new audiences into 
the current decade. It may seem odd, given our discussion up to this point, to talk 
about story generation systems reaching audiences — but one of the features that sets 
this recent pair of story systems apart from those of the 1970s and *80s is a marked 
concern for audience experience. This is expressed not only through an increased 
focus on elements of discourse (rather than pure story structure) but also through 
active attempts to have the outputs of Brutus and Terminal Time experienced by 
audiences outside the Al research community. 

In addition to a greater concern with audiences, Brutus and Terminal Time are 
also set apart from previous work by their creators' attitudes toward the simulation of 
the behavior of human authors. Selmer Bringsjord and David Ferrucci, the creators 
of Brutus, actively argue that this is an impossible goal for AI. On the other hand, 
for the the view of Al that underlies Terminal Time — Michael Mateas's “Expressive 
AI” — the question of simulating human authors is simply beside the point. 

Despite these similarities, however, Brutus and Terminal Time take vastly differ- 
ent approaches to the challenge of story generation, serve widely different rhetorical 
goals for their creators, and vary greatly in the experiences made available to their 
audiences. As we examine these differences we will also be bringing our discussion of 


story and language generation to its final stage. 


324 
Brutus 


Given its name, it is probably no surprise that the Brutus system specializes in stories 


of betrayal. Here is the beginning of one: 


Dave Striver loved the university. He loved its ivy-covered clocktowers, its 
ancient and sturdy brick, and its sun-splashed verdant greens and eager 
youth. He also loved the fact that the university is free of the stark 
unforgiving trials of the business world — only this ¿sn a fact: academia 
has its own tests, and some are as merciless as any in the marketplace. A 
prime example is the dissertation defense: to earn the PhD, to become a 
doctor, one must pass an oral examination on one's dissertation. This was 
a test Professort Edward Hart enjoyed giving. (Bringsjord and Ferrucci, 
2000, 199-200) 


The story continues for roughly another half page. Hart is a member of Striver's 
dissertation committee, and when they meet in Hart's book-lined office he promises 
to sign Striver's dissertation at the defense. The story says that Hart's “eyes were 
bright and trustful, and his bearing paternal.” But at the defense Hart actually 
refuses to sign, and Striver fails. The story ends with an unexpected twist: we see 
Hart sitting in his office, “saddened by Dave's failure,” and trying to think how he 
could help him. We realize that Hart has lied to himself about his betrayal. 

Such examples demonstrate that, in terms of audience experience — both of lan- 
guage and structural coherence — Brutus's output goes far beyond any of the story 
generation systems we have considered thus far. And, in contrast with Minstrel, the 
creators of Brutus (Selmer Bringsjord and David Ferrucci) haven't presented its sto- 
ries as the result of an accurate simulation of human creativity. Instead, they have 


argued at length that such simulation is impossible, devoting much of their book 
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Artificial Intelligence and Literary Creativity (2000) to this issue. From Bringsjord 
and Ferrucci's point of view, the challenge of story generation is to develop a system 
through clever engineering, rather than through emulation of the mechanisms of hu- 
man creativity. As they put it, “we cheerfully operate under the belief that human 
(literary) creativity is beyond computation — and yet strive to craft the appearance 
of creativity from suitably configured computation” (p. 149, original emphasis). And 
their faith in the power of this approach is great. In particular, they talk of the 
Brutus architecture eventually producing “an artificial storyteller able to compete 
against inspired authors” (p. 152) or even able to “find employment at the expense of 
a human writer” (p. xxiv). This certainly sounds like a very successful “appearance 
of creativity.” 

Such statements have also proven a successful formulation for producing press 
coverage of Brutus. Journalists have written many stories that position Brutus as 
a potential competitor to human authors. The New York Times, for example, has 
published more than one story about Bringsjord and Ferrucci's work from this angle 
(e.g., Mirapaul, 1999; Akst, 2004). No other story generator has attracted this level 
of public attention over the last decade. In a particularly fitting confluence, one of 
the Times stories is actually about a literal competition between Brutus and a group 
of human authors. This competition, held on a website run by Dan Hurley, asked 
web browsers to judge five stories: four newly-written stories by human authors and 
one product of the Brutus;, implementation of the Brutus architecture. As the Times 


explains: 


Visitors to the site can vote on which story they think was created by a 
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computer. Hurley and his staff picked the four human-penned pieces after 
reviewing online entries from 390 contributors, who had a month to labor 
over their prose. Bringsjord said Brutus.1 spit out its story in seconds. 
(Mirapaul, 1999) 

The Times story does not report the outcome of the competition, and the website 
in question is no longer online. But just the existence of the competition, especially 
given its coverage in the U.S.'s paper of record, is telling. There is the impression 
— created by a combination of the literary quality of Brutus's output, the rhetorical 
stance of Brutus's designers, and the press coverage generated — that a major mile- 
stone has been passed. The key to this advance, we're told, is that the creators of 
Brutus have put their finger on something essential that storytelling systems need. 
As Bringsjord and Ferrucci put it, “a good artificial storyteller must be in command 
of the immemorial themes that drive both belletristic and formulaic fiction.... such 
themes must be cast in terms that an Al can digest and process; that is, these themes 


must be, for want of a better word, mathematized” (p. 81, original emphasis). 


Mathematized betrayal 


Brinesjord and Ferrucci see a need for “mathematizing” literary themes because they 
approach story generation from the “neat” perspective. As they write: “we approach 
story generation through logic; in more specific terms, this means that we conceive 
of story generation as theorem proving” (p. 42). To my knowledge, Bringsjord and 
Ferrucci are the first to create a major story generation system that employs the tools 
of theorem-proving Al. Specifically, Brutus is built using a logic-programming system 


called FLEX, which is based on the programming language Prolog. 
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relation betrayal_p 
if Evil is some goal whose plan is an EvilPlan 
and whose agent is a Betrayor 
and Saying is included in the EvilPlan 
and Saying is some say 
and Thwarting is included in the EvilPlan 
and Thwarting is some thwart 
and Betrayeds Goal is the prevented_goal of Thwarting 
and Betrayors_Lie is some support of the Betrayeds_Goal 
and Betrayed is some person 
whose goal is the Betrayeds_Goal 
and whose beliefs include the Betrayors_Lie 


Table 7.1: Brutus's Ideal Representation of Betrayal 


The first step in Brutus's story generation procedure is to instantiate a theme. 
As mentioned earlier, the literary theme that is the focus of the Brutus system is 
betrayal. In order to help us understand the system, Bringsjord and Ferrucci provide 
some “actual FLEX code” that is their mathematized expression of betrayal (table 
7.1, quoted from p. 173). This specifies two characters, the Betrayor and Betrayed. 
The Betrayed has some goal that the Betrayor promises to support, and the Betrayed 
believes the promise. But it is a lie, because the Betrayor actually has an evil plan 
to thwart the Betrayed's goal.* 

However, this is not the FLEX code used by Brutus; — the version of Brutus 
that Brinesjord and Ferrucci have actually implemented. Instead, they tell us, “In 
Brutus;,, thematic instantiation captures the essential characteristics of betrayal by 


building the following frame” (p. 190). This frame is shown in table 7.2. Notice that 





lNote that, given the story example above, it must be possible for the Betrayer to be consciously 
unaware of this evil plan, even after executing it. 
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frame betrayal is a concept 

default betrayer is a person and 

default betrayed is a person and 

default betrayers_goal is a goal and 
default betrayeds_goal is a goal and 
default betrayers_lie is a statement and 
default betrayers_location is a place and 
default betrayers_evil_ action is a action. 


Table 7.2: Brutus¡'s Implemented Representation of Betrayal 


it specifies less about the relationship and character internals (it does not specify 
that the Betrayed believes the Betrayer's lie, that the lie is regarding support of the 
Betrayed's goal, or that the Betrayer has an evil plan) and more about the events 
(not only a lie, but also a location, and also an evil action). 

Both of these theme instantiations, despite their differences, may remind one of 
Minstrel's planning advice themes. After all, PATs are also formal specifications 
of structures of relationship and event, which are meant to capture the immemorial 
themes that drive literature. But, as we”ll see, in the end the formal theme definitions 


of Brutus and Minstrel end up playing quite different roles in the two systems. 


Creating a story 


In order to tell a story about betrayal, Brutus needs to fill the roles of Betrayer and 
Betrayed. One might imagine the two characters being created on the fly, using some 
method akin to those of Universe or Tale-Spin — but constrained to ensure that 
each character will be appropriate for their role. Or, alternately, one might imagine a 


situation in which the system begins with a set of automatically-generated characters 
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(such as that in Universe after character generation). The system might then begin 
storytelling by choosing a character from this set with a goal that can be thwarted. 
It might then make that character the Betrayed, and then either create or assign a 
character to be the Betrayer. 

But neither of these is what happens. Instead, the system searches its “domain 
knowledge-base” — its memory — to see if two characters exist that already meet the 
criteria for the relationship of betrayal. That is to say, it looks for a character who 
already has what it takes to be the Betrayer (a goal, and a lie, and an evil action), 
and who takes part in appropriate events for betrayal, and where those events also 
include a character who has what it takes to be the Betrayed. 

The only way for the characters to get into the knowledge base is for them to be 
hand-created by humans. This means that Bringsjord and Ferrucci must write the 
FLEX code to represent a character who meets the criteria for being the Betrayer 
and to represent another who meets the criteria for being the Betrayed. However, 
Bringsjord and Ferrucci assure readers this sort of thing will change in a future version 


of the system: 


In Brutus;, thematic instantiation requires that the domain knowledge- 
base include many of the specific objects required to instantiate the theme. 
In future versions of Brutus the process of thematic instantiation will 
use general domain concepts as a basis for generating specific objects so 
that less domain knowledge is required as input into the story generation 
process. (p. 191) 
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Generating a plot 


Once Brutus has instantiated a theme and found the appropriate characters in mem- 
ory, the next stage is to develop the specifics of the plot. Bringsjord and Ferrucci 
characterize the plot development process as taking place through “simulation” — 
but it's a different form of simulation than, say, that which animates Minstrel”s model 
of authorial creativity. 

Bringsjord and Ferrucci tell us that the simulation begins with the fact that each 
character has a goal and a plan. They write, “Once a character is set in motion, the 
character attempts to achieve its goal by executing the actions associated with the 


goal's plan” (p. 177). So, for example, here is Hart's evil plan: 


instance evilGoal is a goal 
agent is hart and 
plan is (liel01, refuse_to_sign101) and 
success is status of strivers_defense is failed. 
(p. 178) 


Each action, such as “refuse_to_sign101,” has preconditions. These prevent it from 
being executed except in the right circumstances. We can assume, for example, that 
“refuse_to_sign101” only will execute when the aforementioned “strivers_defense” is 
the event taking place at that moment of the simulation. Here is the representation 
for that event: 


instance strivers_defense is a thesis_defense 
thesis is strivers_thesis and 
where is university_ofrome and 
examined is striver and 
committee is (hart,meter,rogers,walken| and 
status is scheduled. 
(p. 175) 
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In some ways this is similar to the Universe model, in which hand-coded plot 
fragments are strung together by the system's processes. But Universe chooses which 
plot fragment to use next based on a continually-shifting set of characters and goals. 
Depending on the circumstances, different characters can play different roles in the 
same plot fragment. On the other hand, for Brutus — or at least Brutus; — the 
specific characters are already encoded into each event. Also, each character must 
already have encoded into their plan the actions that they will take when a particular 
event is active. And, as we see on page 176 of Bringsjord and Ferrucci's book, 
even these actions are not generic, but each exist in a version hand-coded for the 
appropriate character. 

Given this, while the Brutus model of plot development can be characterized as a 
simulation, it's not a simulation that could lead to unpredictable outcomes. Rather, 
producing any plot requires hand-creating the specific characters, the specific plans, 
and the specific instances of events that will allow the theorem prover to produce a 
match on its logical pattern. The plot mechanisms that are at work in Brutus, then, 
sufter from the opposite problem of those produced by Minstrel. Rather than out of 


control variation, they seem to produce no variation at all. 


Structuring a story 


Once the story?'s events have been activated, and then added to memory, these events 
must be structured for their presentation to an audience. For this purpose Brutus 


uses a “story grammar” — a type of description of story structure first developed in 
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the analysis of stories, rather than their generation.? While Bringsjord and Ferrucci 
don't provide the grammar for the story quoted at the outset of this section, they do 


provide this “simplified” fragment: 


1. Story —> Setting + Goals_and plans + Betrayers evil action + be- 
trayed's_state 


2. Goals_and_ plans — Betrayed's_goal + Betrayers_promise + Betray- 
ers_goal 


. Setting —> setting_description(betrayal_location,pov,betrayed) 
. Betrayed's_goal —> personal_goal_sentence(betrayed) 
. Betrayers_goal —> personal_goal_sentence(betrayer) 


. Betrayers_promise —> promise_description(betrayer,betrayed) 


XA CD 0 R$ oO 


. Betrayers evil action —> narrate_action(betrayers evil action) 
(p. 196) 


It may not be immediately obvious how to read such a grammar. The rules are 
relatively simple. Each line defines something (on the left side of the arrow) that 
can be “rewritten” as something else (on the right side). If something is uppercase, 
then it is a “non-terminal” — there is another rule for rewriting it. If something is 
lowercase, a “terminal,” then it will be sent to the Brutus natural language generation 
grammars (which will be discussed in the next section). The items in parenthesis at 
the end of some terminals are arguments passed to the NLG grammars. 

This grammar is remarkably concrete. Most story grammars operate at a much 
higher level of abstraction. For example, a common terminal for many story grammars 


is a generic “event” — rather than a “personal goal sentence.” And many story 





2With a genealogy usually traced back to Vladimir Propp's Morphology of the Folktale. 
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grammars contain items that may be repeated, or which are optional — whereas 
each item in this grammar appears exactly once. This greater concreteness addresses 
one of the commonly-voiced concerns about story grammars: that, while a fine tool 
for analysis, they don't provide the necessary structure for generating well-formed 
stories. On the other hand, this grammar is so tightly structured that it is nearly a 
sentence-level outline. 


Here is a story generated from this grammar: 


Dave loves the university of Rome. He loves the studious youth, ivy- 
covered clocktowers and its sturdy brick. Dave wanted to graduate. 
Prof. Hart told Dave, “I will sign your thesis at your defense.” Prof. Hart 
actually intends to thwart Dave's plans to graduate. After Dave completed 
his defense and the chairman of Dave's committee asked Prof. Hart to sign 
Dave's thesis, Prof. Hart refused to sign. Dave was crushed. (p. 197) 


Notice that the first two sentence of this story have a very similar structure to the 
first two of the story quoted at the beginning of this section: “Dave Striver loved the 
university. He loved its ivy-covered clocktowers, its ancient and sturdy brick, and its 
sun-splashed verdant greens and eager youth.” In both cases, they were generated 
by the terminal of the third item in the grammar above: “setting description” — but 
operating with slightly different elements. It is here that we get into the specifics of 
how Brutus generates literary texts for its audience, and some of its most interesting 


contributions to the field of story generation. 


334 


Literary augmented grammars 


Just as Brutus uses story grammars, it also uses paragraph and sentence grammars. 
These are relatively standard linguistic tools, but in Brutus they are augmented with 
hand-crafted literary knowledge and named “literary augmented grammars” (LAGs). 
One particular LAG, which creates the second sentence in “setting description,” is 
the “independent parallel setting description” (INDPSD). 
Bringsjord and Ferrucci present the INDPSD grammar three times in the course 

of explaining the version used in “setting description.” The first presentation is of a 
relatively standard, if somewhat specialized, grammar: 

e INDPSD > SETTING verb FP 

e FP > its? FEATURE FP | “its* FEATURE 


e SETTING => noun phrase 
e FEATURE —= noun_ phrase 


(p. 181) 


In this grammar uppercase words are non-terminals, words in single quotes are 
“literals” (used in the sentence exactly as they appear), and lowercase words are 
terminals (to be selected /constructed from the lexicon). The “|” symbol, which did 
not appear in the story grammar, can be read as “or.” In this case, it indicates that 
an INDPSD can contain one or more FPs — because an FP can be rewritten either 
as “ “its” FEATURE” or as “ “its* FEATURE” followed by another FP. This creates 
the parallelism that gives the INDPSD its name. 

However, this grammar clearly is not enough to create a sentence such as “He loves 


the studious youth, ivy-covered clocktowers and its sturdy brick.” Where is Brutus 
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to find such features? While there are large bodies of “common sense” knowledge 
that have been encoded into structures such as the Cyc Ontology, common sense 
knowledge is not the same as literary knowledge. 

In order to address this, Brutus includes hand-encoded information about the 
“iconic features” of different objects, from a literary point of view. These iconic fea- 
tures are both positive and negative, to assist in portraying the object from differing 


points of view. Here, for example, is the listing of iconic features for universities: 


frame university is a object 
default positive_iconic features is 
(clocktowers, brick, ivy, youth, architecture, books, knowledge, 
scholar, sports) and 
default negative iconic features is 
(tests, competition, “intellectual snobbery”+. 
(p. 184) 


Obviously, this sort of literary knowledge is such that it would differ from story 
to story. In fact, 1t might differ from character to character within the same story. 
For example, a character who is an athlete and another who is her professor might 
disagree strongly about whether “sports” are a positive or negative iconic feature of 
the university. Nevertheless, Bringsjord and Ferrucci's approach is an intriguing one. 

Of course, this is still not enough information to produce a sentence such as the 
output we have seen from INDPSDs. Another level of knowledge is required, which 
Bringsjord and Ferrucci call “literary modifiers.” Here are the iconic features and 


literary modifiers for ivy: 
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frame ivy is a object 
default positive iconic features is (leaves, vines| and 
default negative_iconic_features is (poison) and 
default negative literary_modifiers is (poisonness, tangled) and 
default positive literary_modifiers is (spreading, green, lush). 
(p. 184) 


However, this still is not enough to ensure the creation of sentences such as those 
we have seen from INDPSDs. The grammar needs to be further augmented, with 
more consistency of structure and more direction about how to fill out aspects of 
the grammar. For this purpose Brutus contains what its authors call “literary con- 
straints.” Here is the second version of the INDPSD, incorporating a set of such 
constraints: 

e INDPSD > SETTING verb (isa possessive_verb) FP(n=3) 
e FP > 'its*? FEATURE FP | “its* FEATURE 
e SETTING — noun phrase (has role setting) 


e FEATURE — noun phrase (isa iconic_feature_of SETTING) 
(p. 185) 


In the grammar above, the literary constraints are the elements that appear in 
parenthesis. For example, the “(n=3)” in the first line enforces the occurrence of three 
FPs in the sentence — creating the three-part parallelism seen in both examples of 
the INDPSD we have seen. This is simply a structural rule. But elements such as 
the last line's “(isa iconic_feature_of SETTING)” create the connections between the 
grammar and literary knowledge. 

This, however, still doesn't bring us to the point that would create sentences such 


as those we have seen generated by INDP5SDs. Specifically, both of the examples above 
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begin with “He” — identifying Dave Striver. Throughout Artificial Intelligence and 
Literary Creativity Bringsjord and Ferrucci emphasize the importance of simulating 
consciousness at the level of language, of describing things from the point of view of 
characters. This brings us to the final version of this LAG: 

e POV > Agent (is a person) Verb (is a PC Verb) FirstFP 

e FirstFP > Setting FEATURE FP 

e FP > its FEATURE FP | “' 

e FEATURE => noun. phrase (is a feature of SETTING) 


e SETTING — noun phrase (is a setting) 
(p. 188) 


Some of the previously introduced constraints seem to have been removed in 
order to make it easier to see what is new in this version. In particular, Bringsjord 
and Ferrucci draw attention to the “(is a PC Verb)” constraint. They have hand- 
encoded a special set of verbs as “pc_verbs” which “include verbs for feeling, thinking, 
understanding, wanting, etc.” (p. 187). Presumably there is also a way in which the 
LAG restricts this so that we get a pc_verb that reflects the point of view of the 


Agent, but this is not specified. 


Brutus and creativity 


We can see that story grammars, LAGs, and literary knowledge are at the root 
of Brutus's successful natural language generation. The results are undoubtedly 
much more literary than those from systems such as Tale-Spin and Minstrel, and 


comparable with human-written stories. In this way Brutus successfully demonstrates 
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the power of solutions specially created for story generation — rather than oft-the- 
shelf bodies of knowledge or NLG techniques. 

On the other hand, the very approach made apparent by Brutus's story gram- 
mars, LAGs, and literary knowledge begins to undermine some of the rhetoric that 
has produced interest in the Brutus project — especially the rhetoric around its “ap- 
pearance of creativity.” A good summary of this rhetoric comes in Bringsjord and 


Ferrucci's chapter 6: 


The main goal behind the development of Brutus is to produce real, work- 
ing systems which, by virtue of their internal logical structure (which im- 
plements the architecture) and implementation specifics, allow for gener- 
ated stories to be sufficiently distant from initial, internal knowledge rep- 
resentations (called, again, creative distance) and to vary independently 
along different dimensions (called wide variability). (p. 161) 


We can recall from previous examples that systems such as Tale-Spin, Minstrel, 
and Universe — at some level — threaten to spin out of control. The unpredictable 
interactions between their components mean that their authors can be quite surprised 
by the outcomes (and not always pleasantly, as “mis-spun” stories attest). Bringsjord 
and Ferrucci usefully give a name to the gap in predictability that arises from these 
highly variable component interactions: “creative distance.” We might not choose to 
call this gap creativity, but it is a gap that certainly exists for these systems. 

On the other hand, it is also apparent that Brutus — or at least Brutus; — 
has almost no gap, almost none of this unpredictability, and almost none of the dis- 
tance from “initial, internal knowledge representations” that Brinesjord and Ferrucci 


discuss. Let's clarify this. Bringsjord and Ferrucci write: 
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If we look “under the hood” of a program and find it trivial for a human 
to transform the program's initial data to the program's output, we are 
less likely to consider the program creative. If however, we, as humans, 
would find it challenging to map the program's initial data to a creative 
artifact (like a well-written and interesting story), then we are more likely 
to consider the program creative. We call the perceived difference between 
a program's initial data and its output creative distance. (p. 161) 


It is not clear what level and type of human difficulty would be required to make a 
program appear “creative.” We may wonder, for example, if all statistical techniques 
(as soon as they operate on non-trivial amount of data) by definition appear creative 
— given that it would be nearly impossible for a human to carry out the calculations 
involved. But, regardless, in the case of Brutus we can see that even a cursory view of 
its knowledge representations (such as that performed above) easily allows a person 
to see what characters will take what actions during what story events — as well as 
what sentences in what order will be used to communicate the story. 

With all this so explicitly specified, what possible space could there be for “creative 
distance” in Brutus? Bringsjord and Ferrucci don't answer this directly, instead 
focusing on the criterion of variability. They write: “Significant storytelling and 
literary variability can be achieved by altering, adding, or selecting different story, 
paragraph, or LAGs. Content variability can, of course, be achieved by creating or 
modifying thematic relations, behavioral rules, and domain knowledge” (p. 197). In 
other words, one or more elements of an explicitly encoded Brutus story could be 
replaced with other, equally explicitly encoded elements — and, unsurprisingly, this 


would generate a different (though far from unpredictable) result. 
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The author of Brutus's stories 


If we accept Brutus as research into the creation of reusable story components, which 
can be interestingly combined with elements at other levels of abstraction, we can 
actually see much of value in it — though we will have to dispose entirely with 
the rhetoric of “creativity.” However, taking this perspective brings up another is- 
sue. In descriptions of Brutus one particular level of component is privileged: the 
mathematized description of literary themes. But thinking back over the different 
components presented for the Brutus; story outlined in Bringsjord and Ferrucci's 
book, it becomes apparent that this is the one unnecessary component. It is not used 
in creating characters, relationships, events, or language. All the elements of the story 
are defined independently of the formal account of betrayal. It is used only to match 
elements already in the knowledge base, which could be found by any number of 
other procedures (e.g., choosing a character at random, who will already be specified 
in all the events for his story, which in turn will already specify the other charac- 
ters involved). In other words, in Brutus the system's encoded knowledge about the 
structure of literary themes makes no contribution to “creative distance.” Instead, 
this distance is entirely created — to whatever extent it exists — by the shape of 
the pieces into which Bringsjord and Ferrucci decided to cut the story for which the 
formalized account would produce the logical match. 

This, in turn, might make us uncomfortable with reports, such as that from The 
New York Times, that “Brinesjord said Brutus.1 spit out its story in seconds.” In 
fact, Bringsjord and Ferrucci spent much longer than a few seconds specifying exactly 


what elements, and what versions of those elements, and what order of those elements, 
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would make up the story that “Brutus,” produced. In fact, presenting the Brutus; 
system as a story author simply seems unwarranted, now that we have examined its 
operations. 

Here, perhaps, we see the lingering impact of traditional views of Al. Even though 
Bringsjord and Ferrucci no longer claim that their system simulates human creative 
behavior, they still can't escape trying to create the impression that the system is 
the creative actor — that its stories are artworks, and the system is the author of 
these stories. Of course, this is not a necessary perspective. One could view the 
Brutus, system as an artwork, and its stories as particular surface outputs from 
the artwork, and Brinesjord and Ferrucci as the authors of the artwork. This would 
doubtless produce less press attention, but this is often the case with accurate reports 
of computing systems, and it would not make this perspective more difficult to hold. 
More substantially, this would require a different conceptual approach to Al — such 


as that which produced the Terminal Time project. 


Terminal Time 


The audience experience of Terminal Time is vastly different from that for the other 
systems we have discussed. Picture a darkened theater. An audience watches, pre- 
sumably somewhat disconcerted, as “a montage of Tibetan Buddhist imagery and 
Chinese soldiers holding monks at gunpoint” unfolds on screen. Á computerized 


voice tells them that: 


There were reports that Buddhist monks and nuns were tortured, maimed 
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and executed. Unfortunately such actions can be necessary when battling 
the forces of religious intolerance. (Mateas, 2002, 138) 

Underlying the words, one can hear a “happy, “optimistic? music loop.” It is 
uncomfortable and jarring. And, to make matters worse, the audience feels a certain 
sense of culpability. Terminal Time is not just a generator of uncomfortable stories, 
of distorted stories — it is also a generator of stories each audience “deserves.” 

The Terminal Time project is a collaboration between Al researcher /artist Michael 
Mateas, documentary filmmaker Steffi Domike, and media artist Paul Vanouse. Each 
story it generates is an ideologically-biased historical narrative of the previous mil- 
lennium, and each of these stories is presented as a 20 minute multimedia projection 
with “the “look and feel” of the traditional, authoritative PBS documentary” (Mateas, 
Vanouse, and Domike, 2000). The ideological bias that drives each story is shaped 
by audience responses — recorded by an applause meter — to public opinion polls 
that appear at three points during the performance. For example: 

What is the most pressing issue facing the world today? 

A. Men are becoming too feminine and women too masculine. 
B. People are forgetting their ethnic heritage. 

C. Machines are becoming smarter than people. 


D. Its getting harder to earn a living and support a family. 


E. People are turning away from God. (Domike, Mateas, and Vanouse, 
2003) 
The ideological model derived from audience responses is always an exaggerated 
one. It is a representation of the positions for which the audience has applauded, but 


taken to an untenable extreme. As it drives the selection of events to be recounted, 
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and the “spin” with which each will be presented, it — rather than reinforcing au- 
dience belief in these ideological positions — inevitably creates an ironic distance 
between the words being uttered and the message being conveyed. 

Yet Terminal Time stories aren't the result of a “mathematized” account of irony. 
They are the result of careful authoring. This authoring effort included the creation of 
a computational account of ideology — but while the system “knows” about ideology, 
the irony layered on top of the ideology is not represented in the system. Only the 
authors and the audience get the joke. 

And here we see what may be the most significant move that Terminal Time 
makes, relative the other story generators we have discussed. It reintroduces the au- 
thor(s) and audience as essential elements of fiction — through its emphasis on the 
context of reception (rather than only the generated text) and through interactions 
with the audience that generate the ideological model used in each presentation of 
Terminal Time. This reintroduction gives us reason to return to Marie-Laure Ryan's 
full model of fiction, rather than the partial “internal” model introduced in Chap- 
ter 5's discussion of Tale-Spin, and also to consider a manner in which Ryan's model 
may need to be deepened. Terminal Time points to this deepening by continuing 
to give an important place to the generator's internal representations and processes, 
using the framework of “Expressive AI.” And, at the same time, it turns our attention 
back to the moment of Tale-Spin's creation, to note a counter-tradition in Al that 
was present even before those early days of Schank's lab at Yale. 

We'll discuss these implications further as we ground ourselves in the specifics of 


Terminal Time's operations. 
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Computational ideology 


Each Terminal Time performance is divided into four sections. Section one is a two 
minute introduction which sets the audience's expectations — combining a “Master- 
piece Theater” style of delivery with the information that a historical narrative will 
be produced, for that audience, by Terminal Time's mechanisms. This is followed 
by the first question period, in which “an initial ideological theme (from the set of 
gender, race, technology, class, religion) and a narrative arc (e.g. is this a progress 
or decline narrative) are established” (Mateas et al., 2000). The next stage is the 
generation and presentation of the part of the story covering the years 1000 to 1750 
of the common era (C.E.), which takes six minutes. Following this, a second set of 
questions “refines the ideological theme chosen in the first set, and possibly intro- 
duces a sub-theme (e.g. combining race and class, or technology and religion).” The 
next section of story is then generated and presented, covering roughly 1750 to 1950 
C.E., and again taking six minutes. This is followed by a final set of questions which 
“further refines the theme(s) and introduces the possibility for a reversal (e.g. a de- 
cline narrative becoming a progress narrative).” This is followed by the generation 
and presentation of the last phase of the story, covering roughly 1950 C.E. to the end 
of the millennium. 

As each phase of storytelling takes place, the ideological models not only become 
further shaped by audience responses, but also more blunt in their operations. This, 
combined with audiences” greater familiarity with more recent history, causes the 
exaggerated ideological spin of the story to become steadily more apparent over the 


20 minutes of a Terminal Time performance. Towards the end of performances this 
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culminates in story fragments such as the glowing description of the Chinese invasion 
of Tibet quoted above. In that particular case, the ideological goals at work were those 
that Terminal Time's creators refer to as those of the “anti-religious rationalist.” 
The representation of ideology in Terminal Time is based on that developed for the 
Politics system (on which more below). Ideology is represented as a set of goal trees 
— specifically, rhetorical goals for what the story will demonstrate through its history 
of the millennium. While the initial audience polling produces one of the goal trees 
originally crafted by Terminal Times authors, further questioning may add, delete, 
or change goals. For example, during the second round of questioning a sub-theme 
may be introduced via the combination of goals from one tree with another. Below 
is the anti-religious rationalist goal tree as it exists before any modifications. Notice 
that, because “show-thinkers-persecuted-by-religion” is a subgoal of both high-level 


goals, it can satisfy both both of them. 


show-religion-is-bad 
show-religion-causes-war 
show-religion-causes-crazy-self-sacrifice 
show-religion-causes-oppression 
show-religion-causes-selfabuse 
show-thinkers-persecuted-by-religion 

show-halting-rationalist-progress-against-religion 
show-thinkers-opposing-religious-thought 
show-thinkers-persecuted-by-religion 

(Mateas et al., 2000) 


In their paper published in 2000, Mateas, Domike, and Vanouse write that in 
Terminal Time “[njine major ideologues are represented using a total of 222 rhetorical 


goals.” The authors of Terminal Time represent the crafting of these ideological 
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models as authoring, as the creation of an artwork. But this is not always the way 


that their work is understood in the Al community. As Mateas reports: 


The first time I presented Terminal Time to a technical audience, there 
were several questions about whether 1 was modeling the way that real 
historians work. The implicit assumption was that the value of such a 
system lies in its veridical model of human behavior. In fact, the archi- 
tectural structure of Terminal Time is part of the concept of the piece, 
not as a realist portrait of human behavior, but rather as a caricature of 
certain institutionalized processes of documentary film making. (Mateas, 
2002, 57-58) 


This reception of Terminal Time should, perhaps, come as no surprise, given the 
concerns of the Al field we have seen in our discussions of earlier story generators. 
And, in fact, it should also come as no surprise when we know something of the 
Politics system from which Terminal Time's ideological goal trees are descended. 
Developed by Jaime Carbonell* in Schank's lab, Politics is presented as “a set of 
closely cooperating computer programs that simulate human ideological understand- 
ing of international political events” (Carbonell, 1981, 259). In the Politics system 
goal trees are used to represent how a “conservative” and “liberal” ideologue interpret 
events such as the Panama Canal treaty. This may seem like the stuff of parody — 
as it is, to some extent, in Terminal Time. But, instead, Politics was created in an 
environment in which its creator could, with a straight face, use its operations to sup- 
port the argument that “building working programs is an integral part of formulating 
theories of human reasoning” (p. 270). 


And yet, as we'll discuss below, the picture is not quite that simple. The basis 





Carbonell would later serve as a co-chair of Mateas's dissertation committee. 
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of Politics was another system, created by Abelson, that is difficult to interpret 
as anything but a parody. First, however, let us look further at Terminal Time's 


operations. 


Event knowledge 


In order for Terminal Time events to be accessible to the system, they need to be 
represented in a formalized manner. The Terminal Time approach to this problem 
involves building on top of a representation of everyday knowledge (mentioned briefly 
in our discussion of Brutus) called the “Upper Cyc Ontology.” For Terminal Time's 
purposes, the approach taken by the Cyc ontology both structures how its own terms 
will be defined (as assertions in a knowledge base that also includes the terms from 
Upper Cyc) and provides the lower-level grounding on top of which its terms are 
defined. 

Terminal Time's historical events cover a range of levels of abstraction. They 
include, for example, the First Crusades, the invention of Bakelite, and the rise of 
Enlightenment philosophy. Here is the representation of one event, the Giordano 


Bruno story: 
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:: Giordano Bruno 
(Sisa %GiordanoBrunoStory %HistoricalEvent) 
(Sisa %GiordanoBrunoStory %IdeaSystemCreationEvent) 
(Sisa %GiordanoBrunoStory % Execution) 
(%circa %GiordanoBrunoStory (»DateRangeFn 

(»CenturyFn 16) (%»CenturyFn 17))) 
SeventOccursAt %GiordanoBrunoStory $ContinentOfEurope) 
SperformedBy %GiordanoBrunoStory %GiordanoBruno) 
SoutputsCreated %GiordanoBrunoStory %GiordanoBrunosldeas) 
Sisa %GiordanoBrunosIdeas S$PropositionallnformationThing) 
Sisa %GiordanoBrunosIdeas $SomethingExisting) 
%conflictingMOs %GiordanoBrunosIdeas %MedievalChristianity) 
Sisa %GiordanoBrunosIdeas Y%IdeaSystem) 
$performedByPart VGiordanoBrunoStory 
%TheRomanCatholicReligiousOrg) 
(SobjectActedOn %GiordanoBrunoStory %GiordanoBruno) 
(Mateas et al., 2000) 


( 
( 
( 
( 
( 
( 
( 
( 





In the above representation, terms preceded by a “$” are defined in the Upper 
Cyc Ontology, while those terms preceded by “%” are defined within the Terminal 
Time ontology in terms of the Upper Cyc Ontology. An English-language gloss of 


this event representation, provided by Mateas, Domike, and Vanouse, reads: 


The Giordano Bruno story, a historical event occurring in the 16th and 
17th century, involved the creation of a new idea system and an execution. 
The idea system created in this event conflicts with the idea system of 
medieval Christianity. Both Giordano Bruno and a portion of the Roman 
Catholic Church were the performers of this event. Giordano Bruno was 
acted on (he was executed) in this event. 


In order for a Terminal Time ideologue to make use of such an event, it must 
be possible to determine that the event can be “spun” to support one of the current 


rhetorical goals. The Terminal Time system identifies candidate events by testing 
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them for applicability. These tests are carried out through an “inference engine” 


written in Lisp. Here is the test for “show-thinkers-persecuted-by-religion”: 


(and 

Sisa ?event VIdeaSystemCreationEvent) 

Sisa ?event % Execution) 

SoutputsCreated ?event ?newIdeas) 

%conflictingMOs ?newldeas ?relBeliefSystem) 
(Sisa ?relBeliefSystem $Religion)) 

(Mateas et al., 2000) 


( 
( 
( 
( 


As Mateas, Vanouse, and Domike point out, this is not any sort of general test 
for finding all instances of thinkers being persecuted by religion. For example, it 
assumes executions are the only type of persecution. Similarly, Terminal Time's 
representation of the Giordano Bruno story is not the only possible one. The Terminal 
Time authors point out that, in other circumstances, 1t might be desirable to represent 
Bruno's writings and his execution as separate events, rather than one compound 
event. But, again, Terminal Time is not trying to create a realistic simulation of 
the behavior of historians, or create a system that “really understands” history, or 
be itself a “creative” system. Instead, Terminal Time is an authored artwork. If the 
authors desired — at some point — for the system to be able to identify examples 
of religious groups persecuting thinkers that do not involve executions, in order to 
employ these events in its stories, then the test could be broadened to match the new 
class of events. As of their 2000 paper, the authors report that the system includes 
“134 historical events and 1568 knowledge base assertions” (beyond those assertions 
in the Upper Cyc Ontology). Given that all the possible examples of events involving 


religious persecution of thinkers (among that 134) also include executions, a broader 
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test is not needed. 


Assembling the storyboard 


Events that make good candidates for the story are placed on the system's “story- 
board.” However, before being placed on the storyboard, events are “spun” by means 
of rhetorical plans. These select a subset of information available that relates to the 
event and lay out an order for its description. So, for example, the rhetorical plan 


for the goal “show-religion-causes-war” (which can satisty “show-religion-is-bad”) is: 


Describe the individual who called for the war, mentioning their religious 
belief 


Describe the religious goal of the war 
Describe some event happening during the war 
Describe the outcome 


(Mateas et al., 2000) 


A “spin” contains all the elements specified by a rhetorical plan, as well as in- 
formation about the rhetorical goal being satisfied (and all its parent goals). This 
information about rhetorical goals is necessary because the selection of events for 
each section of the story is performed via constraints, some of which handle events in 
terms of the rhetorical goals they serve. A number of these constraints come from the 
current Terminal Time ideologue. For example, here are the storyboard constraints 


for the anti-religious rationalist during the first six minute section: 
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%rhet-goal :show-religion-is-bad) 

%rhet-goal :show-religion-is-bad) 

%rhet-goal :show-religion-is-bad) 

%rhet-goal :show-religion-is-bad) 

%rhet-goal :show-halting-rationalist-progress) 

%and (Yorhet-goal :show-halting-rationalist-progress) 
(Vorhet-goal :show-religion-is-bad)) 

(Mateas et al., 2000) 


ATA... - 


This determines that there will be six events in this section's representation on the 
storyboard, which serve the specified rhetorical goals. In a sense, there are six event 
“slots.” There is not yet any order to these slots, however. Order is created by using 
an ideologue's “rhetorical devices.” These devices create the connections between 
events — and associated with each device is a set of constraints on the events that 
can appear before and after it. For example, here is a rhetorical device from the “pro- 


religious supporter” ideologue (a counterpart to the “anti-religious rationalist” ): 


(def-rhetdev :name :non-western-religious-faith 
:prescope-length 2 
:prescope-test (:all-events-satisfy (Jand 
(Sisa ?event %HistoricalSituation) 
(:kb (SeventOccursAt ?event %FirstWorld)) 
(Vrhet-goal :show-religion-is-g00d))) 
:postscope-test (:some-event-satisfies ?spin (Yand 
(Sisa ?event %HistoricalSituation) 
(kb (SeventOccursAt ?event %NonFirst World)) 
(Vrhet-goal :show-religion-is-g00d))) 
:nlg-rule :generate 
:nlg-context-path (:non-western-religious-faith)) 
(Mateas et al., 2000) 


The “prescope” test specified for this device requires that both of the previous 
two event spins occur in the First World and satisfy the rhetorical goal of showing 


that religion is good. The “postscope” test requires that the immediately following 
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event also satisfy the rhetorical goal of showing that religion is ggod — but take place 
somewhere other than the First World. When this rhetorical device is used in story 
generation it calls an NLG rule to create the connection between events. In this case 
the rule is quite simple, resulting in the pre-written sentence “The call of faith was 
answered just as ardently in non-western societies.” 

To summarize, Terminal Time assembles the storyboard for each section of its 


story as follows: 


1. First, it finds the events that can be spun to support the current ideologue's 


rhetorical goals, and makes them into “spins.” 


2. Next, spins are added to the storyboard (as an unordered collection, only some 
of which will be used). Constraints on the storyboard (such as those from the 
current ideologue) determine how many events, and serving what rhetorical 


goals, will actually be used in each section of the generated story. 


3. Finally, Terminal Time identifies a set of rhetorical devices that can connect the 
right number and type of events (to meet the storyboard constraints) searching 
using the events currently available on the board (and needing to meet the 


internal constraints imposed by each device's prescope and postscope tests). 


Presenting the story 


Once the storyboard for a portion of the story is assembled, the collection of spins 


and rhetorical devices is sent to the NLG system. This system follows a set of rules 
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for generating both the English text of the story and a set of keywords. (Unfortu- 
nately, the published information about Terminal Time is not detailed enough to 
provide a concrete example of its NLG rules in action.) A text-to-speech program 
is used to generate a narrative voiceover for the story, lending it an unmistakably 
“computerized” tone. Keywords are used to select sequences of digitized video that 
will be played during the narration, and these are accompanied by music. As of 2000, 
the system contained 281 rhetorical devices, 578 NLG rules, and a video database of 
352 annotated 30 second clips. 

Terminal Time is always presented in a theatre, before a live audience. Usually, 
it is presented twice for the same group, with each performance followed by a brief 
discussion with one or more of Terminal Time's authors — resulting in an overall 
audience experience of roughly one hour. As of their 2000 paper, Mateas, Vanouse, 
and Domike report that Terminal Time had been presented in “14 venues, including 
the Carnegie Museum of Art, the Warhol Museum, and as the Walker Museum's 


entry in Sonic Circuits.” 


Terminal Time and audiences 


There are a number of results that come from the authors presenting Terminal Time 
twice, in quick succession, to the same audience. One, quite obviously, is the clear 
reintroduction of the author and audience into the Terminal Time model of fiction 
— putting it in line with Ryan's model. However, another effect is to foreground 
the fact that this model is not fully adequate when working with story generators. 


At each performance, Terminal Time generates two quite different narratives of the 
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same millennium. In doing so, it makes clear that it is (in the terminology of Mateas's 
“Expressive AI”) both a “message from” and “messenger for” the author. It not only 
presents the possible world in which there is a machine that creates historical doc- 
umentaries (which could be accomplished by a traditional fiction) and presents two 
different narratives created by this machine (possible worlds within worlds are cer- 
tainly a feature of traditional fiction) but makes it clear that this machine actually 
exists, and operates, and could produce a larger number of fictions than that audi- 
ence could possibly sit through. This maneuver, this establishment of the fact that 
Terminal Time is not only a message but an operating messenger, could be compared 
to the difference between writing the Borges story “The Garden of Forking Paths” 
and actually constructing the labyrinth novel described within it. Conceptually they 
are both very much the same and widely distinct. The method of Terminal Time's 
presentation brings home this distinction, and makes clear that we must somehow 
deepen Ryan's model if it is to be used to discuss story generators as fiction. 
Another impact of dual presentations of Terminal Time is that it allows the 
audience to change their relationship with its interface. Jay David Bolter and Diane 
Gromala, in their book Windows and Mirrors, point out that, even in one viewing, 


Terminal Time provides its audience with a dual experience: 


As spectators, we experience a more or less transparent movie.... As 
participants in the voting process, however, we are very conscious of the 
interface, and we are meant to reflect on our participation in the vote 
— in particular, on the notion that our ideology is being tested. The 
experience is reflective. (2003, 134-135) 


Presenting Terminal Time twice creates a dual reflection. During the first showing 
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audience members can reflect on the voting process, on the resulting story, and on 
Terminal Time's dual performance and parody of the notion of highly customized 
media. But only after the first showing is complete does it become possible for 
the audience to fully reflect on their own complicity in the very structure of market- 
research-style polling that provides the only means of interaction with Terminal Time 
— and decide to stop “playing along with” and instead start “playing against” this 
expectation. Both my own experiences as a Terminal Time audience member (at the 
1999 Narrative Intelligence symposium and at SIGGRAPH 2000) and the reports of 
the Terminal Time authors point to the importance of this shift between showings. 


As the Terminal Time authors write in the book Narrative Intelligence: 


Typically, during the first performance, audiences respond to the ques- 
tions truthfully, that is, actively trying to reflect their true beliefs in their 
answers to the questions. During the second performance they tend to 
respond playfully to the questions, essentially trying on different belief 
systems to see how this will effect the resulting history. (Domike et al., 
2003) 


Of course, the dual showings also serve to allow the audience to begin to form a 
mental image of how the Terminal Time model of ideology drives the documentary- 
creation process. From there it becomes possible for the audience to reflect on the 
gap between this and how (in their view) ideology shapes the documentary-creation 
process of human filmmakers. This can be seen as another of Terminal Times inver- 
sions of the history of Al — a gap in its simulation of human behavior that is not a 


failure, or opportunity for future work, but an opportunity for reflection and debate. 
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Beyond neat and scruffy 


Another interesting reversal in the Terminal Time project, from our point of view, 
is that it combines together a variety of Al approaches drawn from both the “neat” 


and “scruffy” traditions. As Mateas puts it: 


Terminal Time makes use of formal reasoning techniques, specifically a 
form of higher-order logic programming (an implementation of [Elliott éz 
Pfenning 1991]) (neat), to engage in ideologically biased reasoning about 
historical events (scruffy), where historical events are represented as col- 
lections of higher-order predicate calculus assertions (neat) written in an 
ontology based on the Upper Cyc Ontology [Lenat 1995] (Cyc being a 
scruffy instantiation of the neat logicist common sense reasoning agenda), 
all within the context of an audience-interactive exploration of ideological 
bias in representations of history (scruffy). (Mateas, 2002, 59) 


On one level it is hard to argue with this approach. Terminal Time undoubtedly 
creates stories with a more impressive combination of variability, strong structure, 
and expressive presentation than any of the other story generators we have discussed. 
On another level, Mateas takes quite a gamble in pointing out the heterogeneity of 
his approach. Such an approach to the system's architecture risks seeming ad-hoc, 
almost incoherent from a scientific viewpoint. That is, unless there is an alternative 
reasoning that explains this combination in a productive way, that defines an area of 
Al research that can encompass both the scruffy and the neat. 

Mateas's “Expressive Al” is an attempt to define just such an area of Al research. 
Or, perhaps more precisely, to define a new area of inquiry that includes an approach 
to Al research that can encompass both neat and scruffy techniques. He does this by 


introducing the author and audience into the conception of Al generally — not just 
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story generation systems. He writes: 


In Expressive Al the focus turns to authorship. The Al system becomes 
an artifact built by authors in order to communicate a constellation of 
ideas and experiences to an audience. If classical AÍ builds brains in vats, 
and interactionist AI builds embodied insects, then Expressive AI builds 
cultural artifacts. The concern is not with building something that is in- 
telligent independent of any observer and her cultural context. Rather, 
the concern is with building an artifact that seems intelligent and /or alive, 
that participates in a specific cultural context in a manner that is per- 
ceived as intelligent and/or alive. Expressive Al views a system as a 
performance of the author's ideas. The system is both a messenger for 
and a message from the author. (Mateas, 2002, 63) 


Certainly this is a significantly different articulation of the role of Al than we 
have seen behind any of the previous story generators we have examined — and one 
that justifies the use of both neat and scruffy techniques. Though it is also true that, 
in this framework, it seems that the use of almost any techniques could be justified. 
In fact, it becomes an open question, instead, why one would necessarily use Al 
techniques at all, or call the results “AI.” 

From this, two intertwined questions open up. First, is Expressive Al actually AT? 
Second, is Expressive Al anything new? The next section will attempt to address 


these, and the broader question of authorial intelligence. 


Evaluating Expressive Al 


Mateas offers a concise outline of Expressive Al in the introduction to his dissertation: 


Expressive Al has two major, interrelated thrusts: 
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1. exploring the expressive possibilities of AI architectures — posing 
and answering Al research questions that wouldn't be raised unless 
doing Al research in the context of an art practice, and 


2. pushing the boundaries of the conceivable and possible in art — cre- 
ating artwork that would be impossible to conceive of or build unless 
making art in the context of an Al research practice. 


(Mateas, 2002, 2) 


It is easy to see how both of these thrusts could flow from the idea of an Al system 
as a message from, and messenger for, an author communicating with an audience. 
But, again, we face the questions mentioned in the previous section. Isn't the field 
of Al dedicated to systems that are intelligent, not just ones that do something that 
appears intelligent which is really a message from an author to an audience? And 
isn't the creation of a digital media artifact that is actually a message from an author 


to an audience pretty old hat? Let's take the second of these questions first. 


Al as hypermedia 


In the mid-1970s two influential books were published, each defining a quite differ- 
ent vision of digital media. One book was Nicholas Negroponte's Soft Architecture 
Machines (1976) and the other was Ted Nelson's Computer Lib / Dream Machines 
(1974). Negroponte, who founded MIT'"s highly influential Media Lab, envisioned 
personal computers as artificial intelligence-driven hyper-personalized assistants. It 
was suggested that one's relationship with one's computer would become so intimate 
— Characterized by such adaptive intelligence on the computer's part — that it would 


be difficult to even use another person's computer. Nelson, on the other hand, en- 
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visioned things quite difterently. He dreamed of systems no more “intelligent” than 
power tools or movie cameras — and just as effective for artistic creation when placed 
in trained, talented hands. Nelson's overall category for responsive media created 
through authorship was “hyper-media,” encompassing hypertext (of various sorts), 


hypergrams, hyperfilms, and so on. He describes them in this way: 


Hyper-media are branching or performing presentations which respond 
to user actions, systems of prearranged words and pictures (for exam- 
ple) which may be explored freely or queried in stylized ways. They will 
not be “programmed,” but rather designed, written, drawn and edited, 
by authors, artists, designers and editors.... Like ordinary prose and pic- 
tures, they will be media; and because they are in some sense “multi- 
dimensional,” we may call them hyper-media, following mathematical use 
of the term “hyper-.” (p. DM18) 


Here we can see that Mateas's goals are certainly not new. A focus on digital 
media authorship has been with us for decades. On the other hand, a system such as 
Terminal Time would be quite difficult to author following a vision such as Nelson's, 
which seems to imply that each possible combination is “prearranged” by the system's 
author(s).* It is only through the application of policies for creating arrangements, 
rather than simple selection between pre-authored arrangements, that systems with 
the expressive power of Terminal Time can be created. 

Which is to say that Terminal Time also turns traditional AI practice on its 
head. It takes tools that were developed as representations of human intelligence — 


but which critics argued were nothing more than idiosyncratic collections of rules, 





1%And prearrangement has been the strategy of most hypermedia authors, using tools ranging 
from Hypercard to Storyspace to Director to Flash. However, some research has also taken place in 
areas such as “adaptive hypermedia” employing tools with an Al heritage. 
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producing intelligent-seeming results only in very limited circumstances — and em- 
ploys them precisely to construct idiosyncratic collections of rules, meant only to 
drive the behavior of a single work of digital media. In other words, Expressive AI 
uses the tools of Al to bring the goals of hypermedia to a place that could not be 
reached by normal hypermedia approaches — a place of greater process intensity. 
Seen in this way, Expressive Al is not entirely novel from the perspective of either 


field, but marks a potentially significant turn in both. 


Al as intelligence 


The rhetoric around story generators — from Tale-Spin to Brutus — generally focuses 
on the idea that the system itself exhibits something like intelligence or creativity. 
The author who creates the system and the audience that experiences it are generally 
eftaced. By stepping away from this formulation, by insisting on the context of audi- 
ence and author, is Expressive Al also stepping away from Al? That is, is Expressive 
Al only worth considering as a branch of the field of hypermedia (though using Al 
tools) and not as a branch of the field of AI? Mateas addresses this concern in his 


dissertation as follows: 


To the degree that Al research is committed to the functionalist program 
of reading the operations of Al programs as identical to what takes place 
in human beings, it may seem that Expressive Al, by consciously avoiding 
this identity and focusing on representation, expression, and interpreta- 
tion, is no longer Al, that is, can no longer claim to contribute to the field 
of AI. However, I do believe that Expressive Al continues to be Al re- 
search, and that, in fact, Expressive Al, by explicitly viewing Al systems 
as procedural representations rather than models in the strict sense, is 
doing what Al has really been doing all along, rather than what it some- 
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times says it has been doing. While over the years various influential AI 
researchers have claimed that Al systems truly model human intelligence 
(often accompanied by overly optimistic estimates of when general human 
intelligence will be achieved), if you look at what Al researchers actually 
do, and how the experience of constructing and running their systems re- 
sults in new ideas and new system construction, Al programs really look 
more like thought experiments than empirical experiments, explorations 
rather than models. Schank certainly describes the work in the Al lab at 
Yale in this light... (Mateas, 2002, 195) 


Of course, there is still quite a bit of difference between saying that a system is a 
“thought experiment” regarding human cognition (as Schank might) and saying that 
it has no goal of relationship to human cognition, and instead is a piece of expressive 
media (as Mateas does). Given this, it may seem that Mateas is trying to finesse a 
problematic relationship with the history of Al by claiming it isn't problematic at all, 
and in fact is how things have always been, when the evidence simply is not present 


to support his case. 


An ATI counter-tradition 


However, in fact we can locate evidence to support Mateas's assertion relatively 
easily — but by looking at the history of the work that informs Terminal Time, 
rather than at the less than entirely applicable quotations Mateas provides from 
Schank, Daniel Dennett, and others. Remember that Terminal Time's ideological 
goal trees are modeled on those employed by the Politics system developed by Jaime 
Carbonell in Schank's lab. While the operations of Politics were described as an 
attempt to “simulate human ideological understanding” the project was also seen as 


an updating of an earlier project of Abelson's — the Goldwater Machine — using 
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Schank and Abelson's new theories. As Carbonell puts it, “Politics was originally 
conceived as an effort to reformulate the Goldwater Machine in light of the new theory 
of knowledge representation” (Carbonell, 1981, 260). 

But while Politics was not presented as a parody, it is hard to see the Goldwater 
Machine in any other way. Even Carbonell describes it as “spewing ultra-conservative 
rhetoric in response to statements about international political issues” (p. 259). But, 
despite its apparent parodic stance, the Goldwater Machine was also seen as an impot- 
tant research contribution — with its “master script” as one of the first formulations 
of the ideas that would become the Schank/Abelson triumvirate of “scripts, plans, 
and goals.” And the Goldwater Machine's “spewing” is far from the only parody 
created in early Al research. Joseph Weizenbaum's Eliza/Doctor, mentioned in an 
earlier chapter, was conceived as a parody of a Rogerian psychotherapist. But, at the 
same time, Eliza/Doctor was also regarded as an important research contribution. 

In fact, one could trace a sort of counter-tradition within Al, closely intertwined 
with its mainstream, that has created systems meant not as simulations of human 
intelligence but as procedural messages (often humorous) from an author to an au- 
dience. Far from being dismissive of such work, the AI mainstream has often viewed 
it as offering significant research contributions. If we were to trace this counter- 
tradition, and observe how its contributions have been integrated into the designs 
of systems represented as true simulations of human behavior, it seems likely that 
another question might come up: Is Mateas right after all? If the design of a parody 
can become the basis for a system presented straight faced, are Al figures such as 


Schank tipping their hands when referring to their work as thought experiments? Has 
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it really been Expressive Al all along, even if another rhetoric was necessary (perhaps 
for the sake of funders, or for scientific respectability)? 

It is likely that no single, stable answer to these questions exists. But, in any case, 
the Expressive Al attitude has clearly been present, at least as a nameless counter- 
tradition, from the early days of Al research. Given that it has also proven a source 
of innovations found useful by the Al mainstream, giving it a name and formalizing it 
as a research agenda seems perfectly appropriate. At the same time, for our purposes, 
the counter-tradition to which it points can't help but complicate our view of early 
projects such as Tale-Spin. How much, unspoken, did the Expressive Al agenda 


already dwell within the conception of earlier story generation projects? 


Al and art 


Of course, even formalizing Expressive Al as a research agenda within Al doesn't go 
as far as Mateas does in his outline quoted at the beginning of this section. He names 
Expressive AP's thrusts as “exploring the expressive possibilities of AI architectures” 
and “pushing the boundaries of the conceivable and possible in art.” This is more 
than establishing a sub-field of Al focused on art. This is the establishment of an 
independent field to which both Al and art contribute. 

This is different from the usual “art and technology” approach, in which the 
common question is, “What use can artists make of the products of science?” Mateas 


is quite clear in differentiating his agenda, writing: 


Albbased artwork raises research issues for which Al scientists not only 
don't have answers, but have not yet begun asking the questions. Fur- 


364 


ther, the artist's conceptual and aesthetic exploration is not an indepen- 
dent “driver” of the research, providing the specifications for the technical 
work, but rather is deeply intertwined with the technical work; concep- 
tual, aesthetic, and technical issues mutually inform each other. (Mateas, 
2002, 5) 


What's to be gained by such a conception? One answer is: more successful 
projects. The comparison of Terminal Time with other story generators seems to 
establish it. Freed from the demands of the wider Al field, and informed by concerns 
from the arts, Terminal Time actually manages to generate a variety of stories, rather 
than a variety of non-stories (Tale-Spin) or whatever one story is currently encoded 
(Brutus). It also manages more than the technical feat of story generation, creating 


a project of artistic and social relevance. 


Expressive Al”s affordances 


But there is also a more general answer to the question of what is gained by this 
outlook. By pursuing art and Al together, Mateas has also produced a formulation of 
the concerns of each field that is legible to both groups. Specifically, Mateas has coined 
the terms (and concepts) of “authorial affordance” and “interpretive affordance.” 
Recall, from early in this study, the questionable (though not without some basis) 
stereotypes of writers and computer scientists. Among the elements of surface, data, 
and process, writers were presented as almost entirely focused on the surface — while 
computer scientists were presented as almost entirely focused on the processes. To 
put it another way, from the viewpoint of this stereotype writers are almost entirely 


concerned with the audience experience of digital media, while computer scientists 
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are almost entirely concerned with the internal system design (and perhaps how 
it performs relative certain “content-free” benchmarks). These do not seem like 
promising viewpoints to attempt to bring together. Mateas's terminology, however, 
sidesteps the limitations of these stereotypes, casting focus on surface and process in 
terms that highlight the shared concerns of artists and computer scientists from an 
Expressive Al viewpoint. 

The concept of “affordance,” as applied by Mateas, is one brought into the discus- 
sion of human-computer interaction by Donald Norman (though it originated with 
psychologist J. J. Gibson). In introducing the term in his book The Psychology of 
Everyday Things, Norman writes: “When used in this sense, the term affordance 
refers to the perceived and actual properties of the thing, primarily those funda- 
mental properties that determine just how the thing could possibly be used” (1988, 
9). 

Here we begin to see the common ground. First, let us look at “authorial affor- 


dances.” 


Artists and writers, of course, ¿nevitably have some appreciation of non- 
surface aspects of the computational systems they use. Parts of the system that 
aren't those seen by the audience determine what it is like to author using these 
systems. And when the emphasis, through Expressive Al, turns to the system as a 
means of communication between author and audience, a major systems focus be- 
comes the appropriate design of structures that capture and assist the expression of 


author intention. These are authorial aftordances. Mateas explains the term this 


way: 


The authorial affordances of an Al architecture are the “hooks” that an 
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architecture provides for an artist to inscribe their authorial intention in 
the machine. Different Al architectures provide different relationships 
between authorial control and the combinatorial possibilities offered by 
computation. Expressive Al engages in a sustained inquiry into these au- 
thorial affordances, crafting specific architectures that aftord appropriate 
authorial control for specific artworks. (Mateas, 2002, 125-126) 


“Interpretive aftordances,” naturally, are the other side of the coin. They are 
the hooks the system makes available to an audience to aid in the interpretation of 
the system, its actions, and its possibilities. While a concern with the audience is 
certainly familiar to artists, the concept of interpretive afftordance places an empha- 
sis, within that concern, on how the system acts to make these interpretive hooks 
available. Similarly, while concern for a system's “output” is certainly familiar to 
many computer scientists, this concept places an emphasis on how those outputs 
will be interpreted — something that can't be measured with the sorts of tests for 


“correctness” found in a simple debugging procedure. 


Expressive Al”s poetics 


Along with these pieces of vocabulary Mateas also offers a poetics: a recommenda- 
tion that authorial and interpretive affordances be considered together and closely 
matched. “The architecture is crafted in such a way as to enable just those authorial 
affordances that allow the artist to manipulate the interpretive affordances dictated 
by the concept of the piece. At the same time, the architectural explorations suggest 
new ways to manipulate the interpretive affordances, thus suggesting new conceptual 


opportunities” (p. 127). Mateas describes the closely matched authorial and inter- 
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pretive affordances of Terminal Time, but let us take Mateas's poetics and attempt 
to apply them to our first example of story generation: Tale-Spin. 

From an artistic standpoint, a major problem with Tale-Spin is that its most 
interesting fictional operations (elaborate speculations and decision making processes) 
are not exposed to the audience except in minimal reports of their outcomes. (While a 
trivial movement from one place to another results in a laborious report of every stage 
through which it passes.) This does seem like a case of a mismatch between authorial 
and interpretive aftordances, if one considers the creation of Tale-Spin's planning 
structures as a work of authoring. On the other hand, if one takes these structures as 
a given, and focuses on the creation of characters and worlds as Tale-Spin's authoring, 
then Mateas's conception is of little help. From this it is apparent that (for Mateas's 
poetics to hold) a balance must be struck between the operations of the system as a 
whole and the available interpretive affordances, not between the actions available at 
a limited “authoring interface” (such as that of Tale-Spin's question-asking modes) 
and the audience experience. 

Similarly, Mateas's vocabulary does not offer us help in addressing the other major 
artistic problem with Tale-Spin: its lack of higher-level story structure. Nevertheless, 
it is an extremely useful set of vocabulary — casting the concerns of internal processes 
and surface outputs in terms that both artists and computer scientists can embrace. 
Hopefully, given this, these terms will also in time be embraced by digital literature's 


critical community. 
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Toward Interaction 


Mateas's use of “affordances” departs somewhat from that in most discussions of 
human-computer interaction (HCI). In mainstream HCI the most common use of 
the term is to discuss the ways that systems let their users know what actions are 
possible — in what ways the systems can be used. Mateas considers afftordances for 
supporting interaction a sub-type of interpretive affordances. 

Up to this point, this study has discussed afftordances for interaction very little. 
But certainly, even with Terminal Time (which only allows interaction during three 
short interruptions in its 20 minutes of storytelling) the audience experience would be 
quite different without interaction. Picture a version of Terminal Time that, rather 
than asking the audience questions, simply displayed the result of a random process 
that reconfigured the internal ideologue at each of the three breaks. Instead, Terminal 
Time pivots on the notion of giving the audience a history they “deserve.” 

Which is to say, with the introduction of authorial and interpretive affordances 
— with the reintroduction of the author and audience performed by Terminal Time 
after our temporary suspension of their presence beginning with the discussion of 
Tale-Spin — we have come to the end of what we can discuss about process without 
dealing more directly with concepts of interaction and context. The chapter following 
this one will briefly address these topics, which form an important part of the future 
work I plan to pursue in this area. 

Here, however, we will consider two questions of interaction raised by this chapter's 


discussions. 
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Authors and audiences 


The characterization, earlier in this chapter, of Tale-Spin's question-asking modes 
as an “authoring interface” (page 367) is arguably inaccurate — and in a particular 
manner that Mateas's terminology can help tease out. Recall the discussion of Tale- 
Spin's three modes (page 245). The first two modes fill in much of Tale-Spin's data 
about the world by asking the audience. This determines what characters there 
are, what other objects are in the world, which characters know about the objects, 
which character is the main one, the nature of the main character?s problem, how 
the characters feel about each other, and so on. The third mode, on the other hand, 
allows an author to determine all these facts about the world before the simulation 
starts — determining, for the audience, what story they will experience. 

This fact — that the story-determining data can be provided by the audience 
or the author — may remind us of a question in which there has been sustained 
interest for more than a decade: Does digital media blur the distinction between the 
author and the audience? The most famous argument in the affirmative is George 
Landow's from the first edition of his book Hypertext (1991). Landow, in the book's 
first chapter, lays out two ways that “hypertext blurs the boundaries between reader 
and writer” (p. 5) when using systems such as Intermedia (the experimental hyper- 
text system, developed before the web's explosion, that provided the majority of the 
examples for Landow's book). First, readers choose their own reading path, defining 
the text's linearity for themselves. Second, and more substantially, a system such as 
Intermedia functioned in a way that, on the web, has begun to reemerge in spaces 


such as wikis (the underlying form on which projects like Wikipedia are based). As 
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Landow puts it “a full hypertext system ... offers the reader and writer the same 
environment” (p. 7). Readers can make new links in the text they are reading, can 
create their own new nodes in the same system as what they are reading, and, if given 
sufficient privileges, can edit the text of other writers.? 

Critics of this argument have tended to focus, however, only on the first, less 
significant blurring.? Marie-Laure Ryan, for example, in Narrative as Virtual Reality, 


writes: 


To the skeptical observer, the accession of the reader to the role of writer 
— or “wreader,” as some agnostics facetiously call the new role — is a 
self-serving metaphor that presents hypertext as a magic elixir: “Read me, 
and you will receive the gift of literary creativity.” If taken literally — but 
who really does so? — the idea would reduce writing to summoning words 
on the screen through an activity as easy as one, two, three, click.... Call 
this writing if you want; but if working one's way through the maze of an 
interactive text is suddenly called writing, we will need a new word for 
retrieving words from one's mind to encode meanings, and the difference 
with reading will remain. (2001, 9) 


Here we see the loaded categories of author and reader confusing what is actually 
a quite simple fact: As soon as interaction is introduced into a system, the audience 


is providing elements of the work that would otherwise, in order to produce the same 





5This is clearly stated in a Communications of the ACM article about Intermedia: 


Each Intermedia document opens in its own window and any document can be edited 
by any user with appropriate privileges. In addition to the usual features offered by 
direct-manipulation text and graphic editors, Intermedia users can create and follow 
links between selections in any document presented by the file browser. (Haan, Kahn, 
Riley, Coombs, and Meyrowitz, 1992, 40) 


SLandow, on pages 325-330 of Hypertezt 3.0 (2005), demonstrates how Espen Aarsetb's critique 
of this argument in Cybertert (1997) fails to engage with the second, and more substantial, sense in 
which systems like Intermedia blur the distinction between reader and writer. 
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behavior without interaction, have to be provided by the author. In Tale-Spin this is 
the data about the characters and world. In a read-only, node-link hypertext system 
(such as those about which Ryan is writing) this is only the order of the texts. In a 
read-write, node-link hypertext system (such as those about which Landow is writing) 
this also includes the contribution of new links, the contribution of new texts, and 
the editing of those materials already present. 

But to stop at this point of analysis is to miss something crucial: Mateas's au- 
thorial affordances. Tale-Spin's question-asking modes extract character and world 
data from the audience in a particular manner, and an audience member encounter- 
ing Tale-Spin for the first time would have little idea what the consequences of their 
choices might be. Tale-Spin*s world-fixing mode, on the other hand, is unavailable 
to the audience, requires a very different structuring of data, and requires that the 
author be well aware of the system's design. We may decide to call both of these 
authorial aftordances, but there is no danger of confusing the two experiences if we 
view them through this lens. 

Turning to hypertext, the affordances for ordering chunks of text by clicking on 
links produce an experience that is quite fluid. We can also observe that the results 
of this selection are transitory — it produces only the current reading. On the other 
hand, the affordances for ordering chunks of text in a word processing program are (for 
the general use patterns of mainstream programs) anything but fluid, with elaborate 
cut-and-paste operations necessary to move from one ordering to another. The results, 
however, are lasting. Between these two extremes lies one of the earliest hypertext 


concepts: that of the “path” or “trail.” Vannevar Bush's 1945 description of this idea 
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directly inspired early hypertext proponents such as Theodor Holm (“Ted”) Nelson 
and Douglas C. (“Doug”) Engelbart. In common usage, a hypertext path is a directed 
set of links that creates one possible linear order through part of an interconnected 
web of material — which is lasting, which can be shared, and which both authors 
and readers can create in some hypertext environments (though the web, by default, 
makes it available to neither). 

Returning to the Intermedia environment, there, as on a wiki, the authorial and 
interpretive affordances available to all users (with sufficient privileges) are the same. 
The system stops differentiating the roles of author and audience, only distinguishing 
the acts (available to both groups) of interpreting, creating, and navigating material. 
From here, the interesting question becomes what emerges in various communities 
as people are able to move between the two roles for the same set of documents.” 
Against this background, we see the limitations of Ryan's position, which fails to 
engage the specifics of the systems being used by those who formulated the arguments 


she dismisses. 


Interaction warrants processes 


Having, in this chapter, begun to bring the audience and interaction out of the back- 
ground, our perspective on expressive processing shifts. With a view that includes 


the audience perspective, it may seem that Terminal Time is the great exception 





See, for example, recent debate about whether Wikipedia should continue to allow pages to be 
edited by readers who don't register (that is, who won't take on a public, if potentially pseudony- 
mous, identity within the Wikipedia community). 
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among the projects discussed here. It may seem that only Terminal Time actually 
requires computational processes. 

Consider, for example, Brutus. A human author could easily write a Brutus,, 
story for posting on a website or printing in a newspaper. The same is true of 
Umiverse. A human author could produce a plot outline for scriptwriters to turn 
into a document that would guide the shooting of a serial melodrama. Whereas, for 
a Terminal Time story to be realized as a newly-scripted documentary with video 
and voiceover, in seconds, for a live audience, we must have some mechanism that 
operates more quickly than a human author. 

And this perception, that interaction warrants Terminal Time's use of processes, 
is far from unfounded. Consider the possibilities opened up by interactive story gener- 
ation which operates quickly and can be experienced by many people simultaneously. 
A story could be customized not only for a single audience member exploring a virtual 
narrative world, but potentially for thousands — for millions — of audience members 
able to also interact with each other in the same world. While work of this sort is 
only starting to take shape, the important point is that, unlike non-interactive story 
generation, it would be impossible to achieve the same effect by applying only human 
authoring effort. 

But, at the same time, to focus only on interaction would disguise the insights 
gained from the earlier chapters of this study. Tzara's radical newspaper poem and 
Hartman's conservative “Monologues” might appear much the same to us, if we fo- 
cus only on the appearance of random-seeming blocks of text. Or, recall Mateas's 


assertion, quoted earlier in this chapter (page 346), that “the architectural structure 
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of Terminal Time is part of the concept of the piece.” Even though Mateas argues 
that a pieces authorial and interpretive affordances should be closely matched, the 
system that moves most directly from the first to the second is not the goal. The 


operations of the work — its processes — are an important part of any work that 


includes the definition of procedures. 


Expressive Processing 


This study has devoted much of its attention to projects arising from, or in some 
cases later characterized as, artificial intelligence. The primary goal, however, is not 
to provide a history or critique of the Al field. Rather, this study investigates the 
authoring and interpretation of expressive processes. 

To put it another way, Al works are not simply of interest because they are Al, 
and also not simply because they are process intensive. After all, image rendering and 
database search can also be process intensive. Rather, this study has taken up many 
Al projects because they have semantically-focused process intensity. Al processes 
are designed to mean something. 

On the other hand, as we have seen in Mateas's need to coin and defend the 
concept of “Expressive Al,” disciplinary Al focuses on projects intended to mean 
something in a quite abstract sense. Adding back the author and audience is a 
radical step, which places Expressive Al in a potentially troubled relationship with 
disciplinary Al. 


We can see another example of this trouble by remembering the project of “Socially- 
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Situated AI.” Just as Mateas's Expressive AI moves Al in the direction of hypermedia, 
Socially-Situated AI moved in the direction of human-computer interaction (HCI). 
The originator of the term, Phoebe Sengers, rose to some prominence while pursuing 
these ideas (e.g., named one of Lingua Franca's “top 20 researchers changing the way 
we think about technology” in 1999). But only a few years later Sengers abandoned 
disciplinary Al, and began to identify as an HCI researcher. The group she now 
directs at Cornell University is titled the “Culturally Embedded Computing Group.” 

From an authoring perspective, we gain nothing from disciplinary battles. The 
question of whether expressive, semantically-intense processes should be called “AT” 
is of little interest, no more than a potential distraction. Similarly, we have no reason 
to marginalize approaches that don't emerge from an Al tradition. For all these 
reasons, while Mateas's terms for authorial and interpretive aftordances seem worth 
adopting, “Expressive AI” does not seem the best term to use for the larger practices 
that interest us here. These are the factors that have led this study, instead, to the 
term expressive processing for work that explores the potential of semantically-intense 


procedures. 


Al as inspiration 


Finally, it is also worth noting a particularly interesting result of the expressive pro- 
cessing perspective. From this point of view, the approaches of past decades” research 
projects in areas such as story generation and natural language generation become of 
interest for more than historical reasons. Rather than failed models of “real” intelli- 


gence or general linguistic competence, they can now be seen as part of a menu — 
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one made up of inventive approaches from which future digital authors may choose to 
adapt techniques (e.g., Minstrel's TRAMs, Brutus's LAGs) for use in crafting their 
procedural messages. 

Seizing this opportunity will require a new generation of digital writers, able to 
engage both a work's processes and surfaces. Such writers will need to have enough 
technical savvy to employ the tools of Al as semantic processes, but not be caught up 
in the failed dream of a machine that makes art autonomously. They will need to have 
the artistic vision and commitment to develop concepts appropriate for procedural 
work and author the large amounts of content necessary for their realization, and yet 
not be so attached to any particular configuration of words or imagery that they pull 
back from the full potential of processes. We have no way of knowing if a significant 
group of artists of this sort will ever come to exist, but my hope is that this study 


will have helped highlight an idea that one or more of them finds useful. 


Conclusions and Future Work 


A Deeper View of Processes 


Now that we have arrived at the conclusion of this study, this chapter has two goals. 
One is to summarize some of the conclusions about processes that are the results 
of this study's investigations. The other goal is to briefly “foreground” some of the 
elements of the model for digital media suggested in the first chapter that have been 
thus far peripheral. This will begin a discussion of some of the wider implications for 
digital literature, building on the work of this study, that will form a basis for future 
work. 

My primary hope for this study is that it has contributed to a deeper discussion 
of the role of process in digital literature, and thereby digital media more generally. 
Most fundamentally, I hope it has convincingly argued — through example — that 
the processes of digital works are deserving of our attention. 

In general, our field has focused almost exclusively on the surfaces and interactions 
that are available to audiences of digital media. This has led to a large amount of 


fruitful work. But it also has led to misunderstandings and missed opportunities in 
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the interpretation of many works. Further, it has prevented critical work from being 
as fully in dialogue with authors as it might be (given that digital media authors often 
work on defining processes as a primary undertaking) and meant that critical writing 
on digital literature has had very limited applicability to attempts to understand other 
process-intensive phenomena (e.g., non-literary simulations of human behavior). 

Here I have looked at both the processes that define particular works (e.g., Stra- 
chey's generator) and the types of processes developed by entire fields of endeavor 
(e.g., Natural Language Processing). In a way this is similar to how readings of 
surfaces attend both to the language of individual works and how that language is 
situated relative larger constellations such as genre. However, the parallel between 
the study of processes and the study of surfaces soon breaks down — given that the 
study of surfaces is so much better developed. In time I hope the parallel can be pur- 
sued more robustly, as work on the interpretation of processes comes to be pursued 
by a diversity of scholars with a diversity of approaches. 

In the meantime, here are some of the specific contributions I hope this study has 


made: 


e Á set of example interpretations of expressive processes. These range from the 
first work of digital literature (Strachey's love letter generator) through a set 
of historical examples (e.g., Tzara's newspaper poem, the Oulipo's “N + 7”) 
and culminate in a critical survey of the field of story generation. In each 
case the specific operations of the processes were examined, compared with the 
operations of related processes, and considered within the context of their initial 


creation and reception. 
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e A clearer set of concepts for discussing the internals of digital literature and dig- 
ital media. Not simply data and process, but also the more fine-grained model 
of process-oriented contributions described with these terms: implemented pro- 


cesses, abstract processes, operational logics, and works of process. 


e Ásense of what is brought into focus, and what is marginalized, when comparing 
processes. Specifically, this study considered comparisons of processes carried 
out by authors, audiences, and automatic computation — and, within this, the 


differing forms of indeterminacy as viewed from each perspective. 


e Á consideration of some of the culture of computation, from the perspective of 
digital literature. By taking the forms of literature, and by being clearly au- 
thored artifacts, works of digital literature can provide legible examples of ideas 
in circulation in the wider fields of computation. Here we've looked at works 
that reflect “big dreams” of automatic authorship, simulating human behavior, 
and a computational universal language. We've also used digital literature, and 
particularly fiction, as a focusing mechanism for looking at the broad currents 
in fields such as artificial intelligence and natural language processing. And we 
began by using digital literature to drive a clarifying survey of what we mean 


by terms such as “digital” in the first place. 


With this behind us, it's time to look — briefly — at some of the important 


aspects of digital literature that this study did not consider in depth. 
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Figure 8.1: The first chapter's model of digital media 


A Wider View of Digital Literature 


The remainder of this chapter will foreground some of the elements of the model 
of digital media that were placed in the background in the first chapter. These 
elements, as seen from the perspective of this study's more developed understanding 
of processes, will be considered with a view toward my future work in this field. 
Here, again, are the primary elements of the first chapter's model (in addition, its 


image appears again as figure 8.1): 


1. Author(s). Those that create and select the data and processes. We've seen, 
in examples such as n-gram text, how the definition of a work's processes often 
takes place as an addition to a rich composite of previous contributions — and 
in Tale-Spin a clear case of something that is always present in some form: 


process definition being intertwined with a community's beliefs. 
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2. Data. This includes text, images, sound files, specifications of story grammars, 
declarative information about fictional worlds, tables of statistics about word 


frequencies, ontologies of common-sense knowledge, and so on. 


3. Process. The procedures actually carried out by the work. Data and process are 
the major site of authoring for works of digital literature, though not all works 
define processes (e.g., email narratives need not). Within processes we can 
usefully discuss the phenomena of implemented processes, abstract processes, 


operational logics, and works of process. 


4. Surface. The surface is what the work turns to its outside. This encompasses 
what the audience experiences — what the work makes available for inter- 
pretation and interaction (including instructions to the audience). This also 
encompasses any parts of the work that connect with outside processes and 


data sources. More on surfaces will appear later in this chapter. 


5. Interaction. This is change to the state of the work, for which the work was 
designed, that comes from outside the work. For example, when an audience 
member reconfigures a combinatory text (rather than this being performed by 
the work's processes) this is interaction. Similarly, when the work's processes 
accept input from outside the work — whether from the audience or other 
sources. Audience interaction often, by convention rather than necessity, results 
in an immediate additional change to the surface produced by the state change 


in the work. More on interaction will appear later in this chapter. 
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6. Outside processes and data sources. Many works of digital media (and some 
works of digital literature) operate while drawing on dynamic sources of outside 
information, updating outside sources of information, or otherwise interacting 


with outside processes and data. 


7. Audiences(s). People who experience the work's surface. Audiences may be 
members of a small research community or members of the general public. 
They may be presented with fixed system outputs or interact with the system 
during its operations. There may be elements of the system (e.g., a Tale-Spin 
story's character and world data) that are sometimes provided by audiences 


and sometimes by authors. 


These are, of course, categories that are far from rigid — as well as deeply de- 
pendent on each other. To take an example, supporting particular interactions is, 
of course, dependent on displaying to an audience an interactive surface (in con- 
text) generated by appropriate processes and data. The dependencies run in every 
direction. 

Speaking of context, as in the first chapter, it is present for each element but 
not indicated in the diagram. Visible or not, it is essential to our understanding. 
As we have seen, works such as Strachey's love letter generator and Meehan's story 
generator might be rather opaque without an appreciation of the context within which 
they were produced and first received. 

Having given this overview of the model, this chapter will now concentrate on two 


previously-backgrounded elements: interaction and surface. 
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Interaction 


Interaction has been one of the primary foci of those writing about digital literature 
— and digital media generally. Unfortunately, the term has often been used in such 
vague ways that some scholars (such as Espen Aarseth) have abandoned it as useless. 
I respectfully disagree, but wish to employ the term in something closer to its mean- 
ing in computer science, as seen in the distinction between “interactive” and “batch” 
processes. That is, I define interaction from the perspective of the process. This is 
broader than defining interaction from the audience perspective — audience inter- 
actions become a subset of interactions, though certainly a particularly interesting 
subset. 

One could, of course, proceed differently. For example, only audience interactions 
might be considered interactions, with those from other sources being considered 
under a different name. The specific way that elements are divided into categories, 
however, does not strike me as the issue of primary importance. Rather, the important 
issue is that we give attention to these elements, regardless of how we group them 
into higher-level categories. 

And as we begin to think about interaction, from process or audience perspectives, 
it turns out to be useful to return to one of the figures that loomed large at the 


beginning of this study — Alan Turing. 
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Turing test vs. Turing machine 


While the term “Turing machine” is quite famous in computer science circles, in pop- 
ular culture Alan Turing's name is more often associated with the so-called “Turing 
test.” 

Turing, however, actually proposed a game, rather than a test. In an article in 
the journal Mind (1950) he proposed the “imitation game.” This game has three 
participants: “a man (A), a woman (B), and an interrogator (C) who may be of 
either sex.” During the course of the game the interrogator asks questions of A and 
B, trying to determine which of them is a woman. A and B, of course, do their 
best to convince C to see it their way — the woman by telling the truth, the man 
by “imitation” of a woman. The proposed game was to be played over a teletype, 
so that nothing physical (tone of voice, shape of handwriting) could enter into C's 
attempt to discern the gender of the other players based on their performances. 

Turing then asked, “What will happen when the machine takes the part of A 
in this game?” That is, what will happen when a machine, a computer, tries to 
“pass” as female — rather than a man attempting to pass in this way — under the 
questioning of the human, C? Turing proposed this as a replacement for the question, 
“Can machines think?” ! 

Turing's paper is important for a number of reasons. One, as Nick Montfort 


has pointed out (Wardrip-Fruin and Montfort, 2003, 49), is simply that it proposed a 





Though, in common usage, the term “Turing test” usually drops the imitation game and gender 
aspects of Turing's description — focusing instead on whether a human judge believes a textual 
conversant to be human or machine. Artworks such as Mark Marino's Barthes's Bachelorette and 
Greg Garvey's Genderbender 1.0 playfully recover some of the lost aspects. 
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linguistic, conversational mode of computing at a time when almost everyone thought 
of computers as number crunchers. For philosophers, the primary audience of Mind, 
it provided a much more specific, phenomenological formulation of the “problem” of 
machine intelligence. For the not-yet-born field of Artificial Intelligence it provided 
inspiration. But for our purposes it provides something much more basic: an early, 
clear instance of digital media conceived as an interactive experience. 

Remember, from the very beginning of this study (page 23), that Turing machines 
give us a way of thinking about what is computable — that is, what questions can 
we pose and receive an answer? But, as Peter Wegner and others have pointed out 
in recent years, much of the computing we do each day is not of this form. Rather 
than a posed question to which we receive (or fail to receive) an answer, interactive 
computing assumes an ongoing set of processes, accepting and responding to input 
from the outside world, and in some cases (e.g., airline reservation systems) with any 


ending considered a failure (Wegner, 1997). Or, as Wegner puts it: 


Claim: Interaction-machine behavior is not reducible to Turing-machine 
behavior. 


Informal evidence of richer behavior: Turing machines cannot handle the 
passage of time or interactive events that occur during the process 
of computation. 


Formal evidence of irreducibility: Input streams of interaction machines 
are not expressible by finite tapes, since any finite representation can 
be dynamically extended by uncontrollable adversaries. 


That is to say, there is a real, definable difference between a program like Stra- 
chey”s love letter generator and a program for playing the imitation game. The gen- 


erator runs and produces an output, using its data and processes, and is completely 
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representable by a Turing machine. But to play the imitation game requires data, 
processes, and an openness to ongoing input from outside the system that results in 
different behavior by the system — interaction — something for which at least some 


computer scientists believe a Turing machine is insufficient. 


Forms of interaction-focused digital literature 


Once interaction becomes part of our picture, new areas open up for our attention. 
For example, we can turn to some of the most important forms of digital literature: 
hypertext (and hypermedia), interactive characters, interactive fiction/drama (and 
role-playing games), and so on. At the same time, a more formal view of interaction 
gives us the opportunity to divide the field of digital literature based on the form and 
role of computation in each work. This alternative is less familiar, and results in a 
different organization of the field — which may disturb our normal thinking on these 


issues in productive ways. 


Hypertext and hypermedia 


In literary circles, the best known form of digital work is hypertext, thanks to the 
efforts of critics and educators such as George Landow (especially the three editions 
of his book Hypertert, 1991; 1997; 2005) and writers such as Robert Coover (who has 
put his literary weight behind hypertext in such high-profile venues as The New York 
Times, 1992; 1993). As was briefly mentioned in the previous chapter, hypertext 
is the name for textually-focused types of hypermedia. Both the terms “hypertext” 


and “hypermedia” were coined by Theodor Holm (“Ted”) Nelson, who in a 1970 


387 


article outlined two particular types of hypertext: “discrete” hypertext (also known 
as “chunk-sytle” hypertext) and “stretchtext” — as well as other hypermedia forms 


” and “queriable illustrations.” 


such as “hypergrams,” “hypermaps, 

Discrete hypertext is by far the most famous kind of hypertext. It involves “sep- 
arate pieces of text connected by links.” The early working examples of this sort of 
hypertext were created by two teams in the 1960s. One was a team led by Douglas C. 
(“Doug”) Engelbart at Stanford Research Institute (Engelbart and English, 1968), 
which also made famous such innovations as videoconferencing and the computer 
mouse. Another was a team led by Andries (“Andy”) van Dam at Brown University 
(Carmody, Gross, Nelson, Rice, and van Dam, 1969), where Nelson traveled up from 
New York to collaborate, which initiated decades of activity in this area at Brown (in- 
cluding the work of Landow and Coover, as well as this dissertation). While dismissed 
for years by partizans of other technologies, the structure-creating and navigation- 
enabling power of the discrete hypertext link is now beyond debate. The explosive 


growth of the World Wide Web — from research project to global phenomenon in 


less than a decade — has laid the matter to rest.? 





2Unfortunately, hypertext's high profile has also made it a target for critics who may feel that 
other forms of digital literature are not receiving their due. One strategy taken by such critics has 
been to attempt to limit hypertext's scope, so that it can be made clear how much of interest lies 
outside it. In doing so, however, they move against the grain both of the term's original meaning 
and its current use in computer science. 

So, for example, Nelson has tended to define “hypertext” with broad phrases such as his 1970 
formulation: “ “Hypertext” means forms of writing which branch or perform on request” — which 
includes discrete hypertexts, stretchtext, and more. In the 1980s Nelson offered even broader defini- 
tions, which did not necessarily require interactivity, such as the definition “nonsequential writing” 
in the 1980s Computer Lib / Dream Machines (1987). On the other hand, critic Marie-Laure Ryan, 
whose work on possible worlds and hypertext has been discussed earlier, limits hypertext to a par- 
ticular type of discrete hypertext, writing: “In hypertext ... the reader determines the unfolding 
of the text by clicking on certain areas, the so-called hyperlinks, that bring to the screen other 


388 


From our perspective there are two particularly interesting things about the dis- 
crete hypertext phenomenon. First, its power as a means of interaction can be har- 
nessed by processes that are almost trivially simple (as with early web browsers and 
servers). Second, it is presumably only one “sweet spot” among the many poten- 
tial types of hypermedia, which we are beginning to explore more generally with the 
mainstream availability of tools like Flash.* 

As discussed in the previous chapter, a defining characteristic of work on hy- 
pertext/media — as opposed to that which has taken place in Al — is a focus on 
authorship. In practice this emphasis has produced tools and approaches that have 
been more successfully employed by artists and writers than any other digital media 
forms. The results include the canon of literary hypertext (such as that published 
by Eastgate Systems), more media-centered explorations (such as those published by 
The Voyager Company), and most interactive and installation art (much of which, 
until recently, was created with the tool Director — which also enabled most of Voy- 
ager's publications). Examples of all of these have now moved online, of course, in 
forms such as web pages employing client-side scripting, Flash files, Java applets, 


sever-side databases, and so on. 





segments of text” (2001, 5). In a similarly limited vein, Espen Aarseth writes that, “Hypertext, for 
all its packaging and theories, is an amazingly simple concept. It is merely a direct connection from 
one position in a text to another” (1994, 67). As a result of these severely narrow definitions, such 
critics are sometimes forced into absurd intellectual positions. Aarseth, for example, suggests in the 
same essay that the best-known hypertext novel — Michael Joyce's Afternoon — is not a hypertext, 
but rather “a cybertext disguised in hypertext's clothing” (p. 69). This is due to Afternoon's use of 
conditional links, a feature of the Storyspace hypertext authoring environment that lies beyond the 
bounds of Aarseth's limited hypertext definition. 


3Though, as a tool for authoring structured hypermedia, Flash lacks many of the capabilities 
of earlier standards that emerged from the hypertext community, such as HyTime (DeRose and 
Durand, 1994). 
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In all of these forms, most hypertext/media projects have continued to focus 
on what artists and writers do well — produce successful surfaces through careful 
crafting of data. The process vocabulary of most of these works is quite small, but 
that does not limit the effectiveness of the work. After all, films and codex books, 
for example, mainly have very similar forms of system behavior and user interaction, 
but differing data produces a variety of user experiences. 

However, not all critics have understood the wide variety of potential experiences 
possible based on the use of discrete hypertext links, and some have chosen to focus 
on links as a means of creating one literary experience — that of exploration. We can 
understand, of course, why it seems true that the link-based hypertext interaction 
of systems such as Storyspace lends itself to exploration-based fiction (given that we 
have so many examples of artists successfully using Storyspace to create works that 
employ links in an exploratory way). But we also have some evidence that quite 
different “locative media” technologies (such as those used in Teri Rueb's /tinerant) 
are good platforms for exploration-based fiction, and link-based hypertext has shown 
itself effective for utterly different experiences of fiction (such as in Scott McCloud's 


“Carl Comics” ).* 





*Ttinerant (2005) is a site-specific sound installation in Boston, Massachusetts. It invites people 
to take a walk through Boston Common and surrounding neighborhoods to experience an interactive 
sound work delivered via handheld computer and driven by GPS satellite information. During a walk 
which may last for more than two hours, visitors explore the area, finding fragments of a personal 
narrative of family and displacement, interspersed with passages from Mary Shelley?s Frankenstein 
— the classic tale of a technoscientific monster and the family love he witnesses voyeuristically, 
but cannot share. It is an exploration-based narrative, but there are no links to click. “The Carl 
Comics,” on the other hand, use links for purposes other than exploration. For example, Original 
Recipe Carl (1998) tells the story of Carl's death in an auto accident. Clicking links allows the reader 
to expand or contract the number of panels it takes from when Carl promises not to drink and drive 
until we end at his tombstone — from two panels to fifty-two panels. In essence, links change the 
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This, perhaps, points toward a more general confusion. In common discussion 
of digital literature some forms have been defined using terms that specify system 
behavior, while others have been defined in terms of user experience. For example, 
“hypertext” is specified at the level of system behavior — a text that will “branch 
or perform on request” (by links or other means). On the other hand, “interactive 
characters” is a term for those examples of digital literature that produce for users 
an experience related to interacting with a fictional/dramatic character — and how 
the system behaves while producing this experience is not specified. In fact, there 
is no reason that the experience of an interactive character could not be produced 
through a system that presents users with hypertext. However, we might have trou- 
ble discussing this work in digital literature circles, because some critics have become 
so accustomed to viewing “hypertext fiction” and “exploration-based fiction” as syn- 
onymous, even though the system behavior of hypertext does not specify that the 
user experience be exploration based.? On the other hand, we may also accept “hy- 
pertext” and “interactive characters” as the names of different forms because of the 
differing types of emphasis on process and data that are common in each form. Hy- 


pertext/media tends to create assemblages of data, and use processes to interconnect 





level of detail of the story, making it like a comics version of one of "Ted Nelson's original non- 
chunk hypertext concepts: “stretchtext.” Choose Your Own Carl (1998-2001), on the other hand, 
is a crossword-style comic (branching and recombining) on the same subject, which is composed of 
frames drawn based on the suggestions of more than a thousand readers. Here, link-clicking operates 
to reveal the original suggestions considered for each frame. The result, as McCloud puts it, is a 
“Fully Interactive, Multiple Path, Reader-Written, Death-Obsessed Comics Extravaganza.” 


This may be due, in part, to the descriptive vocabulary developed for hypertext systems. While 
Michael Joyce's distinction between “exploratory” and “constructive” hypertexts (Joyce, 1988) is 
certainly useful when trying to explain different types of hypertext systems and user positions, it's 
important to remember that hypertexts that function in what Joyce would call an “exploratory” 
manner may not have exploration as their primary user experience. 


391 


them — whereas interactive characters create smaller data elements that are arranged 


in the process of interaction. 


Interactive characters 


Of all interactive literary forms, interactive characters have the proudest early history. 
Turing's famous “test,” after all, is essentially a definition of a computer's role as that 
of an interactive character. And perhaps the most widely successful early example 
of digital literature was the interactive character Eliza/Doctor. Abelson's Goldwater 
Machine is another early example, which was mentioned in the previous chapter. 
The program Racter — often presented as an interactive character — achieved wide 
attention as the “author” of the novel The Policeman's Beard is Half Constructed 
(1984). 

Unfortunately, the attribution of that novel to Racter is now widely regarded as a 
hoax (Barger, 1993). In any case, the interactive character (or “chatterbot”) version 
of Racter could not have composed anything like the novel presented by William 
Chamberlain (author of the book's introduction, co-author of the program, and, it is 
suspected, author of the elaborate templates that actually produced the novel). And 
Racter is not alone in this deficiency. 

While work on interactive characters has certainly continued, and while their kin- 
ship with the popular understanding of Turing's test has kept a certain amount of 
attention focused on them, no work yet done with interactive characters has succeeded 
in creating anything like a novel. A textual experience that develops meaningfully 


over long engagement seems out of reach for interactive characters. Perhaps this is 
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largely architectural, given that the field has been dominated by attempts to create 
systems that perform well in local interactions, such as small bits of conversation, 
rather than over long-term engagements. However, even when chatting in MUDs and 
other online conversation spaces, wisecracking in online games, and sharing thoughts 
and information over instant messaging networks, the limitations of interactive char- 
acters become apparent quickly. 

On the other hand, there are some important exceptions. One successful area 
of work in interactive characters has been in the creation of characters that don't 
interact linguistically. For example, PF.Magic's Dogz, Catz, and Babyz allowed the 
audience to form long-term relationships that included Al-driven learning on the part 
of the interactive characters. The Japanese Tamagotchi phenomenon, on the other 
hand, was structured around a long-term experience of the interactive characters de- 
velopment. Though the possible interactions and their consequences were very simple, 
compared with PF.Magic's efforts, the highly portable hardware and extremely needy 
characters were found appealing by audiences around the world. 

Another successful area of work in interactive characters has placed these char- 
acters in larger contexts which structure the audience experience. For example, the 
basic concept of Carnegie Mellon's Oz Project (discussed briefly in Chapter 2 on page 
74) was to combine highly complex interactive characters with a structure-oriented 
“drama manager.” This allowed response-oriented character architectures to assume 
the lead in local interactions (with the audience and with other characters) while the 
system kept guiding the overall experience toward an aesthetically pleasing shape. 


Authors working in the form of interactive fiction have also created compelling char- 
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acters by using the tools and contexts of that tradition. For example, the character 
of Floyd the robot in the Infocom title Planetfall is often cited as the first successful 
non-player character in a computer game. This was accomplished by two means: 
giving Floyd witty responses to local interactions and structuring the overall game so 
that successful completion involved Floyd's unexpected (and unexpectedly moving) 
self-sacrifice. 

In fact, this particular kind of larger context for interactive characters — the 
computer game — has become the primary arena in which new kinds of interactive 
characters are being experienced by audiences. I view such games as a subset of the 


third general form 1 wish to discuss here. 


Interactive fiction, drama, and role-playing games 


When literary hyperfiction began to feel like a literary movement — in the 1980s — 
some of the writers involved adopted a slogan that may seem strange to us now: “This 
is not a game.” This slogan didn't come out of a worry that their work would be 
confused with Pac-Man, but from another set of literary projects that then loomed 
large over the field of computing. These projects, the “interactive fictions” (IFs) 
published by Infocom and a handful of other companies, were some of the bestselling 
software of the decade, with titles like Zork becoming nearly household names. 
These interactive fictions were games — one of the primary modes of interactive 
engagement was attempting to solve their puzzles, gaining points and progressing 
toward the conclusion (of both the game and the story). Of course, on some level 


these were also hypertexts. They were texts able to “branch or perform on request.” 
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The method of interaction was to type textual commands and receive variable textual 
replies. And, in fact, in the 1980s edition of Computer Lib / Dream Machines Ted 
Nelson writes under the heading “Interactive Fiction” the sentence: “This is a form 
of hypertext, since hypertext simply means nonsequential writing” (p. DM30). 

And as Nelson's sentence points out, one of the key facts about interactive fictions 
was that, despite the fact that they could be played like a game, they were primarily a 
form of writing. Nick Montfort, in his thoughtful history of the form — Twisty Little 
Passages (2003) — in fact finds the literary riddle the most productive predecessor 
with which to compare these fictions. While the processes involved in interactive 
fiction are somewhat more complex than those employed in traditional discrete hy- 
pertext — parsing human input, managing a simulated world — over the last couple 
decades interactive fiction has received much more attention from individual writers 
than from commercial companies or game designers, and most of these writers use 
one of a small number of relatively mature authoring systems. As discussed briefly in 
Chapter 2 (on page 79) this allows these writers to focus most of their attention on 
shaping each work's data to achieve desired surface results, and comparatively little 
on their works” process definitions. On the other hand, the most innovative work in 
this field has often created not only surprising textual data but also reshaped the 
standard interactive fiction processes in a manner specific to each work. 

Part of the reason that work on interactive fictions has become dominated by writ- 
ers is that IF”s commercial potential was exhausted not long after the rise of hypertext 
fiction as a literary movement. The commercial space that had once been occupied 


by interactive fictions (specifically, games played on home computing platforms) was 
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taken over by graphically-based games. Some of these, the graphical RPGs (Role- 
Playing Games), employed structures based on literary models (such as the quest) 
and could be seen as descendants of interactive fictions. But most of the games that 
came to homes in the 1980s and 1990s were, instead, descended from models of non- 
literary games and sports, with innovation focused on bringing in techniques from 
arcade games, flight simulators, movie special effects, and other driving applications 
of computer graphics. 

During the period of commercial IF"s decline an intriguing vision arose of how 
to take interactive fictions to a new level of both visual and literary engagement. 
This is the vision of “interactive drama” — first proposed in Brenda Laurel's PhD 
thesis, Toward the Design of a Computer-Based Interactive Fantasy System (1986), 
and popularized by her book Computers as Theater (1991). In this vision comput- 
ing is seen as a dramatic experience, and the principles of Aristotle's Poetics are 
used as a guide to computer interaction design. More specifically, computer games 
(including, in Laurel's dissertation, the Atari game Star Raiders) are understood in 
dramatic terms — opening up a vision of how one might move forward from the 
puzzle-solving gameplay of Infocom titles toward something more like an open-ended 
literary experience. As mentioned in the previous section, this challenge was taken 
up most notably by the Oz Project at Carnegie Mellon University (an effort led by 
Joe Bates). Michael Mateas, whose work on Terminal Time and Expressive Al were 
discussed in the previous chapter, was the last PhD student of the Oz Project. His 
collaboration with Andrew Stern (previously of PF.Magic) on the interactive drama 


Facade (2005) has been received as the first successful embodiment of the interactive 
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drama vision. 

Facade, however, lies somewhere between independent artistic production and 
academic research. Models in the commercial game industry remain much closer to 
those from Infocom's days — though writer /designers such as Jordan Mechner (Prince 
of Persia: Sands of Time, The Last Express) and Tim Schafer (Grim Fandango, Psy- 
chonauts) have led efforts toward innovative and successful combinations of gameplay 
and fiction. Meanwhile, the area of commercial computer games that lies closest to 
literary models — role-playing games — has now divided into two areas. In one, 
single-player RPGs, there is increased focus on the quality of storyline (Knights of 
the Old Republic) and the simulation-driven behavior of non-player characters (Fable). 
At the same time, there is increasing attention being given to the field of massively 
multiplayer online RPGs (MMORPGs) such as EverQuest and World of Warcraft. In 
these games the behavior of other characters is often quite intelligent — because they 
are being controlled by other human players, located at Internet-connected computers 
elsewhere in the world. On the other hand, the structure of stories must be different, 
given that the world is persistent (ongoing and changing even when a player leaves) 
and populated by many player characters. This has been addressed in a number 
of ways — ranging from networks of quests that are available repeatedly to differ- 
ent players, world-changing story arcs, and the performance-oriented interventions of 
“event teams” (which will also be discussed briefly in the next section). 

Finally, it is worth noting that, in recent years, some of the concern with games 
and play that informs the fields of interactive fiction and drama have found their way 


into the literary communities in which hypertext fiction flourished. Writers such as 
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Stuart Moulthrop (well known for his hypertext novel Victory Garden) have begun to 
produce “instrumental” or “playable” literary works (such as Moulthrop's Pax). Long 
cured of their allergic reaction to games, such authors are taking the combination of 


literary and playful engagement in intriguing directions. 


Forms and roles of computation 


Having reviewed interactive digital literature from the perspective of well-known 
forms has led us to consider, primarily, work that is interactive from the audience's 
perspective. It has also left intact most of our everyday ideas about how the field 
is organized. But we might also benefit from another view of interactive digital 
literature — one emerging from greater specificity about the forms and roles of com- 
putation involved in the works we are considering. One approach to beginning this 
effort would be to propose different distinctions and see what organizations of the 
field result — both those that run along and those that cut across the grain of our 


current intuitions. 
For example, we could distinguish (1) between 
(a) digital literary works for which computation is required only in the author- 
ing process and 
(b) those for which it is also required during the time of reception of the surface 


by the audience. 


This is a distinction familiar from this study”s first chapter. Drawing some exam- 


ples from various chapers, we can see that (a) includes Strachey's love letter generator, 


398 


computer-generated stories (such as those from Minstrel, Universe, and Brutus) and 
poetry (such as Hartman's), and any literary prints hung on the wall (e.g., at SIG- 
GRAPH). We might call it “digitally-authored literature.” Conversely, (b) includes 
animated elit (e.g., Dakota, discussed in Chapter 2, page 68) if viewed on a computer 
screen, Eliza/Doctor and all other interactive works, email novels, and any literary 
uses of virtual reality Caves, web browsers, cell phones, game consoles, and so on. 


We might call this “digital media literature.” 
A different approach would distinguish (2) between 


(a) those works in which the processes are defined in a manner that varies the 


work's behavior (randomly or otherwise) and 


(b) those that contain nothing within their process definitions that leads to 


varlation. 


This distinction, while not mentioned earlier in this study, might well be a useful 
one — casting a wider net than “interaction.” In this case (a) again includes Stra- 
chey's generator, and the story and poetry generators that were part of la (other 
than Brutus), but also all interactive works, while (b) includes Dakota, most email 
narratives, and so on. A rather different arrangement, which we might refer to as 


“computationally variable” and “computationally fixed” digital literature. 


Within category 2a we could make a further distinction (3) between those that vary 





SA topic given more attention in our coming discussion of surface. 
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(a) without input from outside the work's material and 


(b) with input from outside. 


This is another familiar distinction in this study. Here (a) includes Strachey”s 
generator, and most poem and story generators (though Terminal Time is an impor- 
tant exception, and so are two of Tale-Spin's modes), while (b) includes pieces that 
change based on the day's news, or user interaction, or other inputs. We might call 


these “batch-mode” and “interactive” variable digital literature. 


And within 3b we could distinguish yet again (4) between those that vary with input 


(a) other than from humans, aware of the work, and 


(b) from humans aware of the work. 


Interestingly, (a) includes few works of digital literature, though it is an active 
area of digital art, including works that vary with network behavior (clients for RSG's 
Carnivore), the weather (John Klima's Earth), and the stock market (Lynn Hersh- 
man's Synthia); while it is (b) that includes popular literary forms such as hypertext 
fiction (Victory Garden), interactive characters (Eliza/Doctor), and interactive fic- 
tion (Marc Blank and Dave Lebling's Zork). We might call these “environmentally” 
and “audience” interactive digital literature. It should be noted, of course, that 
this distinction (unlike others above) is not exclusive. The Impermanence Agent (on 
which Í collaborated with Brion Moss, Adam Chapman, and Duane Whitehurst) is an 


example of digital literature that is both environmentally and audience interactive.” 





"The Impermanence Agent (1998-2002) is not interactive in the sense that the audience can, 
say, click on the work. This piece launches a small browser window and tells a story of documents 
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We might visualize the results of this set of distinctions in a manner like this. 
First, our distinction based on whether digital computation is required only in the 


authoring, or also in the audience experience (seen earlier in figure 1.3, page 14): 


(1) digitally authored 
digital media 


Second, our nested set of distinctions based on computational variability, interac- 


tion, and source of interaction (the last of which is not exclusive): 


computationally fixed 
batch mode 
(2) 


audience 
environmentally 


computationally variable — (3) interactive — a 


There is the potential for these sorts of distinctions to be of use in our conver- 
sations about digital literature, especially in combination (or tension) with existing 
groupings based on perceived genre. They help us name more precisely, for example, 
how the computationally variable email messages delivered by Rob Bevan and Tim 
Wright's Online Caroline (which are part of a larger audience interactive system) dif- 
fer from those of most email narratives (which are generally computationally fixed) 


though the messages themselves are not interactive in either case.? These distinctions 





preserved and lost, of impermanence, within it. While this story is being told, the work is also 
monitoring the reader”s web browsing of other sites. Parts of sentences and images from the reader's 
browsing are progressively collaged into the story, using a variety of techniques. This results in a 
different experience for every reader — one which is environmentally interactive in that it draws its 
material primarily from websites created without the work in mind, but is audience interactive in 
that readers can choose to alter their browsing habits in order to provide the work with different 
material to consider for collage into the story (Wardrip-Fruin and Moss, 2002). 


8 Online Caroline (2000, 2001) not only sends the reader email messages — it expects a response. 
The responses don't come by writing email, but by visiting the website of the reader”s online friend, 
Caroline. At the website readers communicate with Caroline via a simulated webcam, enter details 
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may also help us understand the relationship between the body of work in digital lit- 
erature and in the broader digital arts — as well as the relationship between digital 


literature and computational systems more generally. 


Surface and Audience 


During the main body of this study, while considerations of surface were back- 
egrounded, it may have seemed that surfaces are seen, here, as relatively simple. But 
this is not the case. Surfaces are understood, in this study, to be as complex and rich 
as any other element of digital media works. Certainly a work's surface may simply 
be a block of text, but there is obviously a long and compelling tradition of inter- 
preting the complexities of blocks of text — and for good reason. Beyond this, we 
cannot consider surfaces without also considering the contexts of their presentation 
and the audiences that experience them. 

There are many possible examples. For some works we should consider the phys- 
ical arrangement and movements of the work's surfaces, as well as movements of 
the bodies of audience members (and, perhaps, the work's performers). For others 
it is useful to consider the context in which the work's surfaces are presented (e.g., 


within the windows of our everyday computing environments) and to which they 





about themselves via web forms, and experience the unfolding of a 24-part drama. Each email 
sent to a reader is a fixed block of text, but these texts are customized based on what is known 
about the reader from website visits (e.g., whether the reader has children). Also, the sequence of 
messages is not fixed. If a reader goes too long without visiting the website after the receipt of one 
of Caroline's email messages, the character will begin to send reminders and eventually break off 
the “relationship” with a message that includes the words: “I won't mail you any more. 1P'll assume 
you're away, or busy... or maybe you're just fed up with me.” 
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are connected (e.g., other portions of the Internet via web links) and through which 
they circulate. In other words, the potential complexities of surface and audience 
are immense — and for any particular work, selecting the elements that are relevant 
for our attention is part of the interpretive challenge. This section considers a few 


configurations of surface and audience that are important for particular types of work. 


Computation at the surface 


In Chapter 2, beginning on page 70, there was a short discussion of email narratives. 
This discussion pointed out that these narratives build on longstanding ambitions 
of writers to tell epistolary stories, and bring new possibilities to the form. But 
the discussion did not help us answer a potentially puzzling question: How do we 
understand the differences, rather than the similarities, between an email narrative 
such as Rob Wittig's Blue Company and Bram Stoker's Dracula? Both, after all, 
are epistolary stories. Neither is interactive or otherwise computationally variable. 
But Blue Company's letters originally arrived in one's email reader, with appropriate 
datestamps, and the timing of their arrival determined the possible timings of one's 
reading experience. Does this mean that there would be no difference between them if 
Dracula were separated into pieces and sent by post, receiving appropriate postmarks? 
No, not quite. When it was established — at the outset of this study (Chapter 1, page 
18) — that digital literature requires digital computation, understanding computation 
that is required at the surface for elements other than those defined by the author (e.g., 
audience-selected email reading programs as necessary part of surface) became one 


of the challenges. 
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This challenge is particularly apparent for email narratives, blog fictions, and 
other emerging networked forms (Jill Walker has begun interesting work on these 
and related forms under the term “distributed narrative,” forthcoming). It is related 
to a different challenge — that of “artifactual” work. An artifactual project presents 
itself as a collection of computer files or systems (perhaps mixed with physical me- 
dia) rather than as a literary work. The operations of a piece such as John McDaid's 
Uncle Buddy's Phantom Funhouse (delivered as a box of digital and non-digital ar- 
tifacts supposedly left behind by the reader's recently deceased uncle) have nothing 
to do with the network, but viewing the digital files in a computational context is an 
important part of the work's surface. Also, as McDaid explains, this way of shaping 


the surface determines which authorial techniques he sees as appropriate: 


To be precise, in artifactual hypertext, the narrator disappears into the in- 
terface, with the logic of the hypertext becoming the “narration.” Which 
is why, in cases where you are creating a fictional narrator who might 
be given to puzzles or games, such devices can be appropriate. But only 
within, and as aspects of, that narrating interface. (1994) 


Bill Bly, author of the artifactual We Descend, joims McDaid in pointing out 
that such work is not unique to digital forms (Bly, 2000). Just as we may learn 
lessons about how to interpret processes through the work of those who study process- 
oriented artists such as John Cage, we may learn techniques for approaching artifac- 
tual work from those who have interpreted print texts such as Milorad Pavic's Dic- 
tionary of the Khazars or Ursula LeGuin's Always Coming Home. At the same time, 
the fact that some digital artifactual literature is interactive — that, for example, as 


McDaid suggests, there can be puzzles that, when solved, alter the operations of the 
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work's processes — points to the limits of comparisons. Such comparisons are also 
limited by the differences (organizational, structural, and so on) between the digital 
surfaces of artifactual works and the print surfaces of codex books. 

And, of course, given the quick evolution of digital media and its platforms, the 
surfaces of many works come to seem more closely tied to their computational context 
as time goes on. Take, for example, the case of Eliza/Doctor. Our interpretations of 
this work are likely to be a bit odd if we interact with it now in a graphical window 
(e.g., as a Java applet in a web browser) and never consider the original context in 
which its surfaces were experienced. As Nick Montfort has pointed out (Montfort, 
2004), the project was developed on a system that not only wasn't a graphical screen 
— it wasn't a screen at all. Rather, the system's textual surfaces (including the 
audience replies) were printed on a continuous ream of paper fed through a teletype. 
Further, this typewritten interaction was taking place in an environment in which 
people communicated with each other through the very same textual medium (much 
as many people communicate via instant messenger clients today). This is what made 
possible the famous story of the Bolt Beranek and Newman manager conducting 
an increasingly exasperated conversation with Eliza/Doctor, believing himself to be 
communicating with a subordinate. As Janet Murray has pointed out, the spread 
of this story (in several variations, some certainly apocryphal) mirrors that of the 
Paris audience that supposedly fled the theatre when the Lumiere Brothers” film of 
an approaching train was first shown (Murray, 1997). It points to our anxiety that 
the representational power of a new medium might cause us to mistake its products 


for reality. In the world of BBS culture, where 1 first experienced Eliza/Doctor, 
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it remained in a context of predominantly textual software experiences mixed with 
human-to-human textual communication that allowed it to retain much of its original 
impact.? But to interact with Eliza/Doctor now, even if running as a bot on an 
instant-messaging network, is to experience its surfaces in a computational context 


quite substantially different from that in which it was created and first experienced. 


Bodies at the surface 


Just as the WIMP (Windows, Icons, Menus, and Pointers) interfaces of modern com- 
puters provide a surface context for much digital literature, it is also important to 
note that other digital literature presents its surfaces in utterly different contexts. 
Perhaps it will help clarify the issues if we ask ourselves another question, such as 
one first posed to me by Roberto Simanowski: How do we understand the differ- 
ence between Guillaume Apollinaire's “Il Pleut” and Camille Utterback and Romy 
Achituv's Text Rain? Apollinaire's poem is made up of letters falling down the page 
like rain. Utterback and Achituv's installation takes a video image of the audience 
standing before it and projects that image on the wall in front of the audience, with 
the addition (in the video scene) of the letters of a poem falling down like rain and 


resting on the bodies of their readers. Obviously, one difference is the passage of time 





“BBSes, or Bulletin Board Systems, were computers that accepted connections from other com- 
puters over regular phone lines, using modems. Often run by individuals as a community service 
(though there were also commercial BBSes) these machines usually had a small number of dedicated 
phone lines that allowed users to upload and download files, take part in asynchronous discussions 
on “bulletin boards,” make moves in turn-based games, exchange real-time messages with the users 
currently connected to the other phone lines, and interact with programs like Eliza/Doctor. A 
vibrant culture in the 1980s, they disappeared almost overnight as public access to the Internet 
expanded. 
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in Text Rain, and another difference is that Text Rain is audience interactive (lifting 
up a hand on which letters rest causes them to be raised as well). But, at least as 
fundamentally, another difference is that Text Rain is situated in a physical space 
other than a printed page or a computer screen, in which the method of interaction is 
the movement of the readers? bodies (which are represented within the work itself). 
Simanowski has begun to think through the issues we need to consider when inter- 
preting digital literature of this sort from a literary perspective (Simanowski, 2005), 
but the insights of disciplines such as performance studies will also be important as 
we investigate further. 

There are a number of forms of digital literature for which space and the body are 
obviously essential to our consideration of surface — including installation art such 
as Text Rain or Bill Seaman's literary installations, locative media such as /tinerant, 
dance and technology pieces such as Jamie Jewett and Thalia Field's Rest/Less, and 
literary virtual reality such as Sereen (on which 1 collaborated with Sascha Becker, 
Josh Carroll, Robert Coover, Andrew McClain, and Shawn Greenlee).!% But, as N. 
Katherine Hayles reminds us (Hayles, 2004) we're not necessarily well served by ig- 
noring the reader's body when interpreting the audience experience of the surfaces 
of other works of digital literature. It is worth attending to the ways that our bod- 


ies become trained in the unusual WIMP mousing behavior required to read Talan 





10 Rest/Less brings poetry together with dance, music, and technology. A collection of grid-shaped 
poems by Field become the space over which five dancers choreographed by Jewett move — triggering 
spoken language, bells, wind, and video images of the handwritten poems. The performance system, 
developed by Jewett, does not require sensors or tracking aids to be placed on the dancers” bodies, 
leaving them free to interact lyrically with the grid made visible to the audience on the floor of the 
performance space. Screen was discussed briefly in Chapter 4, see page 196. 


407 


Memmott's Lexia to Perplexia or in the combinations and timings of game console 
controller manipulations required to move through Jordan Mechner and Ubisoft's 


Prince of Persia: The Sands of Time. 


Audiences and networks 


There are other issues we should consider that are located with the audience, but not 
precisely at the surface. As discussed earlier, works such as Dakota could have been 
created as traditional animations, and distributed on film. The dramatic growth of 
work in such forms isn't, however, simply an outgrowth of the availability of computer 
animation tools. There is something about the network, and about the growth of 
network culture (especially online forums for amateur cultural production such as 
Albino Blacksheep) that has been important to the development of this work. And 
something about the ability to browse for and view this work in a web browser, using 
the same machine used for work, during any brief break from work. In this way the 
computational context of the surface is important to the work, but not in the same 
way as 1t is for email narratives. 

Of course there is also another kind impact that audiences can have on the surface, 
operating in an online space. A third question might help bring some of these issues 
to the fore: How do we understand the difference between an interactive fiction such 
as Zork and a MUD or MOO? An interactive fiction is a textually-described world 
which one can move through by typing commands: investigating spaces, acquiring 
objects, and interacting with characters. MUDs and MOOs share all these charac- 


teristics with interactive fictions — the primary difference, for a first time visitor, 
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is that the characters in the space are often real people (other visitors, experienced 
participants, and even those involved in constructing the world). That is to say, 
important elements of the surface are produced by other audience members, rather 
than by the work operating in isolation. Torill Mortensen is one of the writers who 
has been thinking seriously about the pleasures of experiencing these textual worlds 
with other players, as their surfaces and data are shaped through time by the actions 
of other players (Mortensen, 2004). In a related vein, TL Taylor has been writing 
about graphical environments such as EverQuest in a manner that foregrounds how 
interactions within the simulated world are shaped by networks of relation “outside” 
of it (Taylor and Jakobsson, 2003). Work of this sort is necessary if we are going to 
understand player experiences, and the context in which the performative narrative 
interventions of “event teams” take place in worlds such as KverQuest and Star Wars 
Galaxies, as well as related forms such as alternate reality games (which, like Elan Lee 
and Sean Stewart's The Beast, often involve elaborate plots and puzzles, hundreds of 


documents, and thousands of simultaneous reader /players in communication). 


Final Thoughts 


The remarks above are, of course, quite preliminary. They appear more as a gesture 
toward what I will be thinking about in the future than as a set of considered conclu- 
sions. The main focus of this study, of course, has been a consideration of expressive 
processing in digital works. 


For me, this focus began when I was young. Í grew up with computers. There 
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were big computers at my mother's university lab, and by the 1980s my father (also 
an academic) wrote freelance for publications like Infoworld. He brought home a 
Kaypro and an Osborne. My babysitter's son Brion, who became my close friend, 
had been exposed to computers from an even earlier age by a father who worked for 
Ford Aerospace. 

I remember that Brion had figured out how to open, edit, and re-save the files for 
some of the computer games we played. There was much trial and error involved, 
as we tried to figure out what could be changed without breaking the system. As it 
turned out, there wasn't much we could do. We had a great time re-writing game text 
and changing the colors in which things displayed, but we couldn't do anything to 
alter the way the games were played — we couldn't alter their processes. Eventually 
the satisfaction of providing new text became minimal, given it was surrounded by 
a context of complex operations we couldn't touch. Our contributions could only be 
comments on the games, never rewrite their mechanics toward a new expression. 

While 1 programmed in Basic and Logo as an elementary school student, my 
projects were largely graphical. It wasn't until high school that I began to seriously 
pursue some combination of text with processes. As with many before me, my first 
concentrated undertaking was a poetry generator (written in Turbo Pascal) which 
I used to produce reams of poetry. Ray Kurzweil hadn't yet begun to practice his 
poetry-based “sort of” Turing test, but I did pass some of my generator's better 
products to friends who turned them in to their teachers — who provided helpful 
comments before being informed of the experiment. I spent an inordinate amount of 


time on such things. 


410 


The obsession continued into college, where I discovered Ted Nelson's Computer 
Lib / Dream Machines and — not long after — the George Landow's recently- 
published Hypertext. Luckily I was at a college that let me define my own studies, 
and Í was set on an artistic and scholarly path from which I haven't deviated. Nor 
do I expect to deviate from it. The richness of the work that has already been done 
in expressive processing still, 1 believe, represents only the beginning. I, for one, plan 
to keep up the exploration. 

In another context I hope to write more from my perspective as an author of digital 
literature, but for now let me conclude by making explicit what (no doubt) a number 
of readers have likely noticed just beneath the surface of this study. There is a poetics 
that shapes my perspective. To put it briefly, it focuses on the importance of the fit 
between a work's processes, its other elements, and its context. For example, to return 
to Eliza/Doctor, this work's name has two parts because they indicate two different 
parts of the system. Weizenbaum's Eliza system was a general system for research 
on computer conversation (Weizenbaum, 1976) — a set of processes. There were a 
number of scripts for Eliza, but only one, called Doctor, became widespread. This 
script — this data — was a particularly good match for the Eliza system because the 
evasive maneuvers of a parodied Rogerian therapist fit well into what the relatively 
primitive natural-language processes of Eliza were capable of supporting. And, as 
has already been explained above, this bundle of data and processes supported a 
mode of interaction that was a good fit with Eliza/Doctor's context. Unlike Tale- 
Spin, which has generally been taken to have less going on than is actually the 


case, Eliza/Doctor's well-matched process and data in context have inspired great 
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imagination on the part of its audiences and given rise to speculations much more 
complex than its actual operations. 1 can only hope that this study will help inspire 


works of similar imagination. 
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